Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken
Dwarkesh Podcast
May 22
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken
Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Dwarkesh Podcast
May 22
Shownote
Shownote
New episode with my good friends Sholto Douglas & Trenton Bricken. Sholto focuses on scaling RL and Trenton researches mechanistic interpretability, both at Anthropic. We talk through what’s changed in the last year of AI research; the new RL regime and h...
Highlights
Highlights
In this podcast episode, hosts Sholto Douglas and Trenton Bricken from Anthropic delve into the latest developments in AI research. The discussion covers reinforcement learning's scalability, model interpretability, and the societal implications of artificial general intelligence (AGI). They explore how countries, workers, and students can adapt to the rapid advancements in AI technology.
Chapters
Chapters
How far can RL scale?
00:00Is continual learning a key bottleneck?
16:27Model self-awareness
31:59Taste and slop
50:32How soon to fully autonomous agents?
1:00:51Neuralese
1:15:17Inference compute will bottleneck AGI
1:18:55DeepSeek algorithmic improvements
1:23:01Why are LLMs ‘baby AGI’ but not AlphaZero?
1:37:42Mech interp
1:45:38How countries should prepare for AGI
1:56:15Automating white collar work
2:10:26Advice for students
2:15:35Transcript
Transcript
Dwarkesh Patel: Okay, I'm joined again by my friends, Sholto Douglas. Wait, fuck.
Sholto Douglas: Did I do this last year? No, no, no, you named us differently, but we didn't have. Sholto Douglas and Trenton Bricken.
Dwarkesh Patel: Sholto Douglas and Tr...