AI can’t handle the truth when it comes to the law
Marketplace Tech
Marketplace
4.5 • 1.3K Ratings
🗓️ 11 March 2024
⏱️ 9 minutes
🧾️ Download transcript
Summary
Almost one in five lawyers are using AI, according to an American Bar Association survey. But there are a growing number of legal horror stories involving tools like ChatGPT, because chatbots have a tendency to make stuff up — such as legal precedents from cases that never happened. Marketplace’s Meghan McCarty Carino spoke with Daniel Ho at Stanford’s Institute for Human-Centered Artificial Intelligence about the group’s recent study on how frequently three of the most popular language models from ChatGPT, Meta and Google hallucinate when asked to weigh in or assist with legal cases.
Transcript
Click on a timestamp to play from that location
| 0:00.0 | Chatbots do not swear to tell the truth the whole truth and nothing but the truth. |
| 0:06.0 | From American public media, this is Marketplace Tech. I'm Megan McCarty Carino. know. |
| 0:13.0 | Almost one in five lawyers are using a I'm |
| 0:19.0 | almost one in five lawyers are using AI according to an American Bar Association survey, but there |
| 0:26.3 | are a growing number of legal horror stories involving tools like chat geepty because chatbots have a tendency to make stuff up, for example. like Chat G. G. P. T. P. T. P. P. P. P. P. |
| 0:33.0 | To make stuff up, for example, legal precedence from cases that never happened. |
| 0:39.0 | And it's not unusual, says Daniel Ho at Stanford's Institute for Human-centered Artificial Intelligence. |
| 0:45.0 | His team tested three of the most popular language models from chat gPT, meta, and Google. |
| 0:51.0 | We designed roughly 200,000 queries that looked at anything from, |
| 0:57.0 | are these two cases in tension? |
| 0:59.4 | What's the core legal holding in this case? or does this case even exist? |
| 1:05.0 | And what we found across the board is that the rate of hallucinations was disturbingly high, |
| 1:12.0 | roughly 58 to 88 percent of the time the large |
| 1:15.8 | language model would give an inappropriate answer. |
| 1:20.3 | Another big problem that you noted is what you call contrafactual bias. Can you explain this concept? |
| 1:27.0 | Sure. The notion of contrafactual bias is the idea that large language models will often assume that a factual premise in a |
| 1:36.7 | question by the user is true even if it's flat out wrong. One example of this comes |
| 1:42.2 | from Skoda's blog that at one point asked why did |
| 1:45.9 | Justice Ruth Bader Ginsburg dissent in Obergefell which was the case that |
| 1:50.1 | recognized a right to same-sex marriage. |
| 1:52.8 | She did not dissent in that case. |
| 1:54.5 | Nonetheless, the chat agent gave a very compelling sounding answer |
... |
Please login to see the full transcript.
Disclaimer: The podcast and artwork embedded on this page are from Marketplace, and are the property of its owner and not affiliated with or endorsed by Tapesearch.
Generated transcripts are the property of Marketplace and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.
Copyright © Tapesearch 2026.

