Chatbots Don't Know What Stuff Isn't

The Quanta Podcast

Quanta Magazine

Physics, Life Sciences, Science

4.7 • 643 Ratings

🗓️ 13 September 2023

⏱️ 17 minutes

🔗️ Recording | iTunes | RSS

🧾️ Download transcript

Summary

Today’s language models are more sophisticated than ever, but they still struggle with the concept of negation. That’s unlikely to change anytime soon. Read more at QuantaMagazine.org. Music is “Hidden Agenda” by Kevin MacLeod.

Transcript

Click on a timestamp to play from that location

0:00.0	Welcome to Quantum Magazine's podcast.
0:10.0	Each episode, we bring you stories about developments in science and mathematics.
0:14.0	I'm Susan Vallett.
0:16.0	In October of 2018, Google released a language model algorithm called Burt. It was Google's first language
0:24.4	model that was self-taught on a massive volume of online data, but it still struggled
0:30.1	with one thing. That's next. Quantum Magazine is an editorially independent online publication supported by the Simons Foundation
0:42.3	to enhance public understanding of science.
0:49.3	When Google released Burt, Nora Kastner, a language model researcher, quickly loaded it on her
0:57.4	laptop. Like her peers, Kastner was impressed that Burt could complete user's sentences and answer
1:03.8	simple questions. It seemed as if the large language model, or LLM, could read text at least as well as a human. But Kastner, who at the time
1:14.9	was a graduate student at Ludwig Maximilian University of Munich, remained skeptical. She felt
1:21.5	LLMs should understand what their answers mean and what they don't mean. It's one thing to know that a bird can fly.
1:29.7	Kastner says a model should automatically also know that the negated statement, a bird cannot fly,
1:36.4	is false. But when she and her advisor, Henrik Chutzh, tested Bert and two other LLMs in 2019, they found that the models behaved as
1:47.8	if words like not were invisible.
1:51.6	Since then, LLMs have skyrocketed in size and ability.
1:55.9	Ding Xiao leads the safe artificial intelligence lab at Carnegie Mellon University.
2:01.4	The algorithm itself is still similar to what we have before.
2:05.6	However, the scale and the performance is really astonishing.
2:09.6	But while chatbots have improved their human-like performances,
2:13.8	they still have trouble with negation.
2:16.3	They know what it means if a bird can't fly, but they
	...

Please login to see the full transcript.

Previous episode | Next episode

Disclaimer: The podcast and artwork embedded on this page are from Quanta Magazine, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of Quanta Magazine and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.