Effective Altruism Forum
AI Safety Newsletter
EA Forum

AI Safety Newsletter

56

AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media

· 3y ago · 4m read

35

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

· 3y ago · 5m read

34

AI Safety Newsletter #4: AI and Cybersecurity, Persuasive AIs, Weaponization, and Geoffrey Hinton talks AI risks

Center for AI Safety

· 3y ago · 6m read

60

AI Safety Newsletter #5: Geoffrey Hinton speaks out on AI risk, the White House meets with AI labs, and Trojan attacks on language models

Center for AI Safety

· 3y ago · 5m read

32

AI Safety Newsletter #6: Examples of AI safety progress, Yoshua Bengio proposes a ban on AI agents, and lessons from nuclear arms control

Center for AI Safety

· 3y ago · 7m read

23

AI Safety Newsletter #7: Disinformation, Governance Recommendations for AI labs, and Senate Hearings on AI

Center for AI Safety

· 3y ago · 8m read

16

AI Safety Newsletter #8: Rogue AIs, how to screen for AI risks, and grants for research on democratic governance of AI

Center for AI Safety

· 3y ago · 7m read

12

AISN #9: Statement on Extinction Risks, Competitive Pressures, and When Will AI Reach Human-Level?

Center for AI Safety

· 3y ago · 9m read

30

AISN #12: Policy Proposals from NTIA’s Request for Comment and Reconsidering Instrumental Convergence

Center for AI Safety

· 2y ago · 8m read

25

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Center for AI Safety

· 2y ago · 10m read

26

AISN#14: OpenAI’s ‘Superalignment’ team, Musk’s xAI launches, and developments in military AI use

Center for AI Safety

· 2y ago · 5m read

3

AISN #16: White House Secures Voluntary Commitments from Leading AI Labs and Lessons from Oppenheimer

Center for AI Safety

· 2y ago · 7m read

15

AISN #17: Automatically Circumventing LLM Guardrails, the Frontier Model Forum, and Senate Hearing on AI Oversight

Center for AI Safety

· 2y ago · 9m read