Paul Christiano - How Misalignment Could Lead to Takeover

"How Misalignment Could Lead to Takeover" by Paul Christiano. Delivered at the 2023 San Francisco Alignment Workshop.

Ajeya Cotra - “Situational Awareness” Makes Measuring Safety Tricky
▶︎

Ajeya Cotra - “Situational Awareness” Makes Measuring Safety Tricky

Paul Christiano — Preventing an AI takeover
▶︎

Paul Christiano — Preventing an AI takeover

Formalizing Explanations of Neural Network Behaviors
▶︎

Formalizing Explanations of Neural Network Behaviors

Rohin Shah - How to Theorize So Empiricists Will Listen [Alignment Workshop]
▶︎

Rohin Shah - How to Theorize So Empiricists Will Listen [Alignment Workshop]

Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech
▶︎

Conan O’Brien Mocks Trump At Harvard Commencement | Crowd Erupts During Viral Speech

Maggie Haberman & Jonathan Swan - On “Regime Change” & Inside The Trump Presidency | The Daily Show
▶︎

Maggie Haberman & Jonathan Swan - On “Regime Change” & Inside The Trump Presidency | The Daily Show

Current work in AI alignment | Paul Christiano | EA Global: San Francisco 2019
▶︎

Current work in AI alignment | Paul Christiano | EA Global: San Francisco 2019

Buck Shlegeris - Can we use permissions management to mitigate our threats? How much novel security
▶︎

Buck Shlegeris - Can we use permissions management to mitigate our threats? How much novel security

The Tech Billionaire Plan To Destroy Democracy | Gil Duran | TMR
▶︎

The Tech Billionaire Plan To Destroy Democracy | Gil Duran | TMR

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026
▶︎

Keynote: After the AI Hype – What’s Real, and What’s Next - Richard Campbell - 2026

Is AI Hiding Its Full Power? With Geoffrey Hinton
▶︎

Is AI Hiding Its Full Power? With Geoffrey Hinton

Jacob Steinhardt - Aligning Massive Models: Current and Future Challenges
▶︎

Jacob Steinhardt - Aligning Massive Models: Current and Future Challenges

Anthopic, OpenAI Should Not Be Allowed to IPO, Says Ed Zitron
▶︎

Anthopic, OpenAI Should Not Be Allowed to IPO, Says Ed Zitron

Paul Christiano: Formalizing Explanations of Neural Network Behaviors
▶︎

Paul Christiano: Formalizing Explanations of Neural Network Behaviors

MIT Just Revealed the AI Bubble's Fatal Flaw
▶︎

MIT Just Revealed the AI Bubble's Fatal Flaw

Professor Jiang: World War 3 Is About To Begin, Let Me Explain!
▶︎

Professor Jiang: World War 3 Is About To Begin, Let Me Explain!

The Future of AI Agents with Andrew Ng | Interrupt 26
▶︎

The Future of AI Agents with Andrew Ng | Interrupt 26

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview
▶︎

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

Zig says NO to AI
▶︎

Zig says NO to AI

The 4 Most Plausible AI Takeover Scenarios | Ryan Greenblatt, Chief Scientist at Redwood Research
▶︎

The 4 Most Plausible AI Takeover Scenarios | Ryan Greenblatt, Chief Scientist at Redwood Research