AI’s New Training Data: Your Old Work Slacks And Emails
Forbes Daily Briefing
Forbes
4.4 • 18 Ratings
🗓️ 22 April 2026
⏱️ 6 minutes
🧾️ Download transcript
Summary
Transcript
Click on a timestamp to play from that location
| 0:00.0 | Today on Forbes, AI's new training data, your old work slacks and emails. |
| 0:07.0 | When Shanna Johnson was winding down CLO-24, the transcription and captioning company she ran as CEO, she discovered an unexpected asset. |
| 0:18.0 | It's operational exhaust, that is, the digital leftovers that |
| 0:23.0 | pile up across years of work and collaboration. To close the company out, she worked with |
| 0:29.3 | Simple Closure, a startup that specializes in helping companies wind down. Simple Closure helped her |
| 0:35.9 | through the usual shutdown paperwork, closing out payroll and taxes, |
| 0:40.5 | getting investor consents in order, and filing paperwork with the IRS. Then came the part nobody |
| 0:47.0 | puts in the founder playbook, selling off Cello 24's 13-year digital footprint. Every slack joke, every Jira ticket, emails documenting |
| 0:57.2 | internal victories or frustrations, sitting in employees' multi-terabyte Google drives, |
| 1:03.2 | selling all that off as training data for the next generation of AI. For that, CLE 24 received, quote, hundreds of thousands of dollars, |
| 1:13.7 | which Johnson said helped her go from, quote, I don't know how we are going to pay our bills, |
| 1:18.9 | to, quote, we can tie this up neatly with a bow and be able to walk away. She told Forbes, quote, |
| 1:26.0 | I'm still a bit emotional about shutting the company down, |
| 1:28.5 | but it's cool to think that our data could be useful, live on, and help other people. |
| 1:34.8 | It's a clean ending for a messy reality. |
| 1:38.2 | The company didn't survive, but its work trail did. |
| 1:42.2 | And in 2026, that trail can be worth real money. |
| 1:46.6 | Johnson's data sale isn't an isolated exit strategy. |
| 1:50.0 | It is a new frontier in the AI arms race. |
| 1:54.2 | AI labs started off by training their models on the public internet, |
| 1:58.5 | Reddit threads, Wikipedia entries, digitized books. |
| 2:02.3 | But they exhausted that, all of it, by late 2024, according to former OpenAI chief scientist |
... |
Please login to see the full transcript.
Disclaimer: The podcast and artwork embedded on this page are from Forbes, and are the property of its owner and not affiliated with or endorsed by Tapesearch.
Generated transcripts are the property of Forbes and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.
Copyright © Tapesearch 2026.

