Richard Sutton – Father of RL thinks LLMs are a dead end
Dwarkesh Podcast
Sep 26
Richard Sutton – Father of RL thinks LLMs are a dead end
Richard Sutton – Father of RL thinks LLMs are a dead end

Dwarkesh Podcast
Sep 26
Shownote
Shownote
Richard Sutton is the father of reinforcement learning, winner of the 2024 Turing Award, and author of The Bitter Lesson. And he thinks LLMs are a dead end. After interviewing him, my steel man of Richard’s position is this: LLMs aren’t capable of learnin...
Highlights
Highlights
In a thought-provoking conversation, Richard Sutton, a pioneer of reinforcement learning and recipient of the 2024 Turing Award, challenges the prevailing trajectory of AI development, particularly the dominance of large language models. He argues that true intelligence must emerge from experience-driven learning rather than static imitation.
Chapters
Chapters
Are LLMs a dead-end?
00:00Do humans do imitation learning?
13:04The Era of Experience
23:10Current architectures generalize poorly out of distribution
33:39Surprises in the AI field
41:29Will The Bitter Lesson still apply post AGI?
46:41Succession to AIs
53:48Transcript
Transcript
Dwarkesh Patel: Today, I'm chatting with Richard Sutton, who is one of the founding fathers of reinforcement learning and inventor of many of the main techniques used there, like TD learning and policy gradient methods. And for that, he received this year'...