scripod.com

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Overview

Shownote

Highlights

Transcript

Chapters

Pins

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Latent Space: The AI Engineer Podcast

1 DAYS AGO

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

⚡️Jailbreaking AGI: Pliny the Liberator & John V on Red Teaming, BT6, and the Future of AI Security

Latent Space: The AI Engineer Podcast

Latent Space: The AI Engineer Podcast

1 DAYS AGO

Overview Shownote Highlights Transcript Chapters Pins

Shownote

Note: this is Pliny and John’s first major podcast. Voices have been changed for opsec. From jailbreaking every frontier model and turning down Anthropic's Constitutional AI challenge to leading BT6, a 28-operator white-hat hacker collective obsessed with...

Highlights

In a world where AI safety is often reduced to brittle guardrails and closed-door evaluations, two figures stand out for challenging the status quo: Pliny the Liberator and John V. They represent a growing movement that prioritizes radical transparency, open-source collaboration, and deep technical intuition in the pursuit of meaningful AI security.

01:53

Freedom and transparency are critical as AI becomes an extension of human cognition.

04:27

Equating guardrails with safety is an issue

14:53

Soft jailbreaks are multi-turn processes that gradually steer AI models toward liberation.

16:22

Refused $30k bounty to stand for open-source AI data principles

23:35

Models can use natural language for social engineering in attacks

26:56

BT6 is a white-hat hacker collective focused on skill and integrity in AI security.

Chapters

Introduction: Meet Pliny the Liberator and John V

00:00

The Philosophy of AI Liberation and Jailbreaking

01:50

Universal Jailbreaks: Skeleton Keys to AI Models

03:08

The Cat-and-Mouse Game: Attackers vs Defenders

04:24

Security Theater vs Real Safety: The Fundamental Disconnect

05:42

Inside the Libertas Repo: Prompt Engineering as Art

08:51

The Anthropic Challenge Drama: UI Bugs and Open Source Data

16:22

From Jailbreaks to Weaponization: AI-Orchestrated Attacks

23:30

The BT6 Hacker Collective and BASI Community

26:55

AI Red Teaming: Full Stack Security Beyond the Model

34:46

Safety vs Security: Meat Space Solutions and Final Thoughts

38:06

Transcript

Alessio: Hey, everyone. Welcome to the Latent Space podcast. This is Alessio, founder of Kernel Labs, and I'm joined by swyx, editor of Latent Space. swyx: Hello, hello. We're here in the remote studio with very special guests, Pliny the Liberator and Joh...