Pam Samuelson on Copyright's Threat to Generative AI

The Lawfare Podcast

The Lawfare Institute

International Law, Law, Government, Foreign Policy, News, Politics, Rule Of Law, International Relations, Current Events, Military, Constitutional Law, Intelligence, National Security, History, Terrorism, Diplomacy

4.7 • 6.4K Ratings

🗓️ 17 July 2023

⏱️ 37 minutes

🔗️ Recording | iTunes | RSS

🧾️ Download transcript

Summary

The only thing more impressive than the performance of generative AI systems like GPT-4 and Stable Diffusion is the sheer volume of training data that went into these systems. GPT was reportedly trained on, essentially, the entire Internet, while Stable Diffusion and other image-generation models rely on hundred of millions if not billions of existing pieces of artwork. Of course, much of this content is copyrighted, and the authors and artists whose work is being used to train these models and, potentially, threaten their own livelihoods are paying attention. A number of high-profile lawsuits are making their way through the courts, and the outcome of these cases could hugely shape, and potentially even stop, progress in machine learning.

To explore these issues, Alan Rozenshtein, Associate Professor of Law at the University of Minnesota and Senior Editor at Lawfare, spoke with Pam Samuelson, the Richard M. Sherman Distinguished Professor of Law at the University of California at Berkeley and one of the pioneers in the study of digital copyright law. She's just published a new piece in the journal Science titled "Generative AI meets copyright,” in which she analyzes the current litigation around generative AI and where it might lead.

Support this show http://supporter.acast.com/lawfare.

Hosted on Acast. See acast.com/privacy for more information.

Transcript

Click on a timestamp to play from that location

0:00.0	The following podcast contains advertising.
0:04.0	To access an ad-free version of the LawFair podcast,
0:08.0	become a material supporter of LawFair at patreon.com slash law fair.
0:14.0	That's patreon.com slash law fair.
0:18.0	Also, check out LawFair's other podcast offerings,
0:22.0	rational security, chatter, law fair no bull, and the aftermath.
0:29.0	What copyright law protects is the exploitation of expression.
0:39.0	From the standpoint of the technologist, this looks like what we're doing is very much like what Google did.
0:47.0	Google was doing it from research library collections.
0:51.0	We're doing it from the open internet.
0:53.0	And web scraping is just something that people do all the time.
0:57.0	And therefore, it must be fair use because it's been allowed for years and years.
1:03.0	I'm Alan Rosenstein, associate professor of law at the University of Minnesota and senior editor at LawFair.
1:09.0	And this is the LawFair podcast for July 17, 2023.
1:14.0	The only thing more impressive than the performance of generative AI systems like GPT-4,
1:19.0	and stable diffusion is just the sheer volume of training data that goes into these systems.
1:25.0	GPT was trained on reportedly what is essentially the entire internet,
1:29.0	while stable diffusion and other image generation models rely on hundreds of millions,
1:33.0	if not billions, of existing pieces of artwork.
1:37.0	Of course, much of this content is copyrighted,
1:39.0	and the authors and artists whose work is being used to train these models,
1:43.0	and potentially to threaten their own livelihoods, their paying attention.
	...

Please login to see the full transcript.

Previous episode | Next episode

Disclaimer: The podcast and artwork embedded on this page are from The Lawfare Institute, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of The Lawfare Institute and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.