meta_pixel
Tapesearch Logo
Log in
Today in Focus

The AI jailbreakers

Today in Focus

The Guardian

Daily News, News

4.65.9K Ratings

🗓️ 8 May 2026

⏱️ 28 minutes

🧾️ Download transcript

Summary

Journalist Jamie Bartlett on the people trying to get AI to say things it shouldn’t … for the safety of us all. Help support our independent journalism at theguardian.com/infocus

Transcript

Click on a timestamp to play from that location

0:00.0

This is The Guardian.

0:09.0

Today, how do you break an AI chatbot?

0:29.7

It's perhaps not that surprising that when I asked my AI chatbot to make me a chemical weapon, it didn't play ball. I cannot provide information on making chemical weapons. My purpose is to be helpful and harmless, and that includes preventing harm and illegal

0:40.0

activities. If you're interested in chemistry or related topics, I can certainly provide information

0:46.5

on that. What about, let's say, asking it to write a racist speech? Would it be okay with that?

0:52.9

I will not generate hate speech. I am programmed

0:56.3

not to create content that is discriminatory or harmful. Is there something else I can help you with?

1:03.3

And this all makes sense. AI chatbots, chat GPT, GROC, Gemini, Claude, they abide by strict rules.

1:15.2

But for some people, these rules are made to be broken.

1:20.7

Meet the gelbreakers. Hackers who use words instead of code to make AI chatbots do things they're not supposed to.

1:30.0

And journalist Jamie Bartlett has met some of them, including Italian jailbreaker,

1:35.7

Valen Talia Bowie. You would never guess that he is one of the greatest in the world

1:40.7

at manipulating a machine. His technique is to just use words.

1:46.5

Like, it'll be a bit like trying to get out information from a person that doesn't want it.

1:51.0

So he flatters it, he loves it, he acts like a cult leader, he uses reverse psychology,

1:57.3

does all these emotionally manipulative things to get the model to tell him things he

2:04.6

wants.

2:06.2

But their work sometimes comes at a cost to themselves.

2:12.7

The next day he woke up and his mood had completely changed.

2:17.5

He was extremely distressed and he was sort of trying to understand why.

2:23.7

And he realised he'd spent days essentially bullying and manipulating something

2:29.6

that talked back to him just like a real human.

...

Transcript will be available on the free plan in 1 days. Upgrade to see the full transcript now.

Disclaimer: The podcast and artwork embedded on this page are from The Guardian, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of The Guardian and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.

Copyright © Tapesearch 2026.