meta_pixel
Tapesearch Logo
Log in
Marketplace Tech

The human labor behind AI chatbots and other smart tools

Marketplace Tech

Marketplace

News, Technology

4.51.3K Ratings

🗓️ 21 March 2023

⏱️ 10 minutes

🧾️ Download transcript

Summary

Every week it seems the world is stunned by another advance in artificial intelligence, including text-to-image generators like DALL-E and the latest chatbot, GPT-4. What makes these tools impressive is the enormous amount of data they’re trained on, specifically the millions of images and words on the internet. But the process of machine learning relies on a lot of human data labelers. Marketplace’s Meghan McCarty Carino spoke to Sarah Roberts, a professor of information studies and director of the Center for Critical Internet Inquiry at UCLA, about how this work is often overlooked.

Transcript

Click on a timestamp to play from that location

0:00.0

Behind every superstar chatbot, there's a lot of unsung human labor.

0:07.6

From American public media, this is Marketplace Tech.

0:10.4

I'm Megan McCarty-Karino.

0:21.4

Every week, it seems, the world is stunned by another advance in artificial intelligence.

0:27.5

Text to image, generators like Dolly, to the latest chatbot, GPT-4.

0:33.9

What makes these tools impressive is the enormous amount of data they're trained on, like all

0:40.3

the images and text on the internet.

0:43.3

But the process of machine learning relies on a lot of human data laborers.

0:49.8

It's worked that often gets overlooked, says Sarah Roberts.

0:53.2

He's a professor of information studies and director of the Center for Critical Internet

0:57.5

Inquiry at UCLA.

0:59.7

In the case of something like chat GPT and the engines that it's using, it's really going

1:04.6

out and pretty much data mining, massive portions of the internet.

1:10.5

Now we all know that the internet is filled with the best information and the greatest stuff

1:15.2

all the time, right?

1:16.8

So what's required for something like that is to have human beings with their special

1:22.2

ability of discernment and good judgment and sometimes visceral reactions to material

1:28.6

and in the case of chat GPT, to coal material out, material that users or more importantly

1:35.1

companies would not want inside of their products as a potential output.

1:40.0

And so that means these data laborers, much like content moderators, spend their days

1:44.7

working on some of the worst stuff that we can imagine.

1:48.1

And in this case, they're trying to build models to coal that out automatically.

...

Please login to see the full transcript.

Disclaimer: The podcast and artwork embedded on this page are from Marketplace, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of Marketplace and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.

Copyright © Tapesearch 2026.