#547: Parallel Python at Anyscale with Ray

Talk Python To Me

Michael Kennedy

Technology

4.8 • 642 Ratings

🗓️ 6 May 2026

⏱️ 59 minutes

🔗️ Recording | iTunes | RSS

🧾️ Download transcript

Summary

When OpenAI trained GPT-3, they didn't roll their own orchestration layer. They used Ray, an open source Python framework born out of the same Berkeley research lab lineage that gave us Apache Spark. And here's the twist: Ray was originally built for reinforcement learning research, then quietly faded as RL hit a wall. Until ChatGPT showed up. Suddenly reinforcement learning was back, as the post-training step that turns a raw language model into something genuinely useful. Edward Oakes and Richard Liaw, two founding engineers behind Ray and Anyscale, join me on Talk Python to tell that story. We'll trace Ray from its RISE Lab origins at UC Berkeley to powering some of the largest training runs in the world. We'll talk about what Ray actually is, a distributed execution engine for AI workloads, and how a few lines of Python become work running across hundreds of GPUs. We'll cover Ray Data for multimodal pipelines, the dashboard, the VS Code remote debugger, KubRay for Kubernetes, and where Ray fits alongside Dask, multiprocessing, and asyncio. If you've ever stared at a single-machine Python script and thought, "there has to be a better way to scale this", this one's for you

Transcript

Click on a timestamp to play from that location

0:00.0	When Open AI trained GPT3, they didn't roll their own orchestration layer.
0:04.9	They used Ray, an open source Python framework born out of the same Berkeley Research Lab lineage
0:10.3	that gave us Apache Spark. And here's the twist. Ray was originally built for reinforcement
0:15.7	learning research and then quietly faded as RL hit a wall. Until ChatTPT showed up, suddenly reinforcement learning was back,
0:24.2	as the post-training step that turns a raw language model into something genuinely useful.
0:29.1	Edward Oakes and Richard Law, two founding engineers behind Ray and AnyScale,
0:33.4	joined me on Talk Python to tell that story.
0:36.4	We'll trace Ray from its RISE lab origins at UC Berkeley to powering some of the largest
0:41.5	training runs in the world.
0:43.3	We'll talk about what Ray actually is, a distributed execution engine for AI workloads,
0:47.8	and how a few lines of Python become work running across hundreds of GPUs.
0:52.6	We'll cover Ray data
0:54.3	for multimodal pipelines, the dashboard,
0:57.0	the VS code remote debugger,
0:59.0	Cube Ray for Kubernetes, and where Ray fits
1:01.5	alongside Dask, multi-processing, and async.
1:05.1	If you've ever stared at a single machine
1:07.3	Python script and thought, there has to be a better way to
1:09.9	scale this. This one's for you. It's Talk Python and thought, there has to be a better way to scale this.
1:11.5	This one's for you.
1:13.2	It's Talk Python and Me.
1:18.0	Episode 547 recorded April 27, 2006.
	...

Please login to see the full transcript.

Previous episode | Next episode

Disclaimer: The podcast and artwork embedded on this page are from Michael Kennedy, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of Michael Kennedy and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.