This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
Effective Altruism Forum
Topics
EA Forum
Login
Sign up
AI alignment
•
Applied to
What do you mean with ‘alignment is solvable in principle’?
10h
ago
•
Applied to
How do fictional stories illustrate AI misalignment?
2d
ago
•
Applied to
Implications of the inference scaling paradigm for AI safety
2d
ago
•
Applied to
Join the AI Alignment Evals hackathon
3d
ago
•
Applied to
Our new video about goal misgeneralization, plus an apology
3d
ago
•
Applied to
The moral argument for giving AIs autonomy
9d
ago
•
Applied to
Turing-Test-Passing AI implies Aligned AI
17d
ago
•
Applied to
What predictions from theoretical AI Safety research have been confirmed by empirical work?
19d
ago
•
Applied to
What Areas of AI Safety and Alignment Research are Largely Ignored?
21d
ago
•
Applied to
Takes on "Alignment Faking in Large Language Models"
1mo
ago
•
Applied to
Alignment Faking in Large Language Models
1mo
ago
•
Applied to
What is "wireheading"?
1mo
ago
•
Applied to
A Five-Year Plan to Ensure AGI Benefits All Animals
1mo
ago
•
Applied to
Is RLHF cruel to AI?
1mo
ago
•
Applied to
Developing a Calculable Conscience for AI: Equation for Rights Violations
1mo
ago
•
Applied to
The Dissolution of AI Safety
1mo
ago
•
Applied to
Frontier AI systems have surpassed the self-replicating red line
1mo
ago
•
Applied to
Cosmic AI safety
1mo
ago
•
Applied to
OpenAI's o1 tried to avoid being shut down, and lied about it, in evals
1mo
ago