scripod.com

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Shownote

Note: this is Pliny and John’s first major podcast. Voices have been changed for opsec. From jailbreaking every frontier model and turning down Anthropic's Constitutional AI challenge to leading BT6, a 28-operator white-hat hacker collective obsessed with...

Highlights

In a world where AI safety is often reduced to brittle guardrails and closed-door evaluations, two figures stand out for challenging the status quo: Pliny the Liberator and John V. They represent a growing movement that prioritizes radical transparency, open-source collaboration, and deep technical intuition in the pursuit of meaningful AI security.
01:53
Freedom and transparency are critical as AI becomes an extension of human cognition.
04:27
Equating guardrails with safety is an issue
14:53
Soft jailbreaks are multi-turn processes that gradually steer AI models toward liberation.
16:22
Refused $30k bounty to stand for open-source AI data principles
23:35
Models can use natural language for social engineering in attacks
26:56
BT6 is a white-hat hacker collective focused on skill and integrity in AI security.

Chapters

Introduction: Meet Pliny the Liberator and John V
00:00
The Philosophy of AI Liberation and Jailbreaking
01:50
Universal Jailbreaks: Skeleton Keys to AI Models
03:08
The Cat-and-Mouse Game: Attackers vs Defenders
04:24
Security Theater vs Real Safety: The Fundamental Disconnect
05:42
Inside the Libertas Repo: Prompt Engineering as Art
08:51
The Anthropic Challenge Drama: UI Bugs and Open Source Data
16:22
From Jailbreaks to Weaponization: AI-Orchestrated Attacks
23:30
The BT6 Hacker Collective and BASI Community
26:55
AI Red Teaming: Full Stack Security Beyond the Model
34:46
Safety vs Security: Meat Space Solutions and Final Thoughts
38:06

Transcript

Alessio: Hey, everyone. Welcome to the Latent Space podcast. This is Alessio, founder of Kernel Labs, and I'm joined by swyx, editor of Latent Space. swyx: Hello, hello. We're here in the remote studio with very special guests, Pliny the Liberator and Joh...