Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Jared Kaplan
FrizzleFried
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Alignment faking in large language models
authored
a paper
about 1 year ago
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
authored
a paper
over 1 year ago
Specific versus General Principles for Constitutional AI
View all activity
Organizations
None yet
Papers
5
arxiv:
2412.14093
arxiv:
2401.05566
arxiv:
2310.13798
arxiv:
2308.03296
Expand 5 papers
models
None public yet
datasets
None public yet