This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
EA Forum
Login
Sign up
Discovering Language Model Behaviors with Model-Written
Evaluations
by
evhub
Dec 20 2022
1 min read
0
25
AI safety
AI alignment
AI risk
Research summary
Frontpage
Reactions
0
0
Comments
Comment
No comments on this post yet.
Be the first to respond.
More from
evhub
View more
Curated and popular this week
Relevant opportunities
View more