#516: Accelerating Python Data Science at NVIDIA

Talk Python To Me

Michael Kennedy

Technology

4.8 • 642 Ratings

🗓️ 19 August 2025

⏱️ 66 minutes

🔗️ Recording | iTunes | RSS

🧾️ Download transcript

Summary

Python’s data stack is getting a serious GPU turbo boost. In this episode, Ben Zaitlen from NVIDIA joins us to unpack RAPIDS, the open source toolkit that lets pandas, scikit-learn, Spark, Polars, and even NetworkX execute on GPUs. We trace the project’s origin and why NVIDIA built it in the open, then dig into the pieces that matter in practice: cuDF for DataFrames, cuML for ML, cuGraph for graphs, cuXfilter for dashboards, and friends like cuSpatial and cuSignal. We talk real speedups, how the pandas accelerator works without a rewrite, and what becomes possible when jobs that used to take hours finish in minutes. You’ll hear strategies for datasets bigger than GPU memory, scaling out with Dask or Ray, Spark acceleration, and the growing role of vector search with cuVS for AI workloads. If you know the CPU tools, this is your on- ramp to the same APIs at GPU speed.

Transcript

Click on a timestamp to play from that location

0:00.0	Python's data stack is getting a serious GPU turbo boost. In this episode, Ben Zatlin from
0:05.5	NVIDIA joins us to unpack Rapids, the open source toolkit that lets Pandas, PsyKit Learn, Spark,
0:12.0	pullers, and even Network X execute on GPUs. We trace the project's origins and why
0:17.7	NVIDIA built it in the open, Then dig into the pieces that matter in practice.
0:22.3	QDF for data frames, QML for machine learning,
0:25.7	Q graph for graphs, QX filter for dashboards and friends like Q spatial and Q signal.
0:31.1	We talk real speedups, how the Pandas accelerator works without a rewrite,
0:35.4	and what becomes possible when jobs that used to take hours finish in minutes.
0:40.3	You'll hear strategies for datasets bigger than GPU memory, scaling out with Dasqueray,
0:45.5	Spark acceleration, and the growing role of vector search with QVS for AI workloads.
0:51.1	If you know the CPU tools, this is your on-ramp to the same APIs at GPU speed.
0:56.7	This is Talk Python to Me.
0:58.0	Episode 516 recorded July 1st, 2025.
1:01.7	Yeah.
1:03.0	Talk Python to me.
1:05.0	Yeah, we ready to roll.
1:06.0	Upgrading the code.
1:07.0	No fear of getting old.
1:09.0	They sink in the air.
1:10.0	New frameworks in sight.
1:11.8	Geeky rap on deck.
1:13.1	Quark crew.
	...

Please login to see the full transcript.

Previous episode | Next episode

Disclaimer: The podcast and artwork embedded on this page are from Michael Kennedy, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of Michael Kennedy and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.