meta_pixel
Tapesearch Logo
Log in
The a16z Show

The Frontier of Spatial Intelligence with Fei-Fei Li

The a16z Show

a16z

Culture, Business, Science, Disruption, Technology, Software Eating The World, Entrepreneurship, Innovation

4.21.2K Ratings

🗓️ 13 November 2025

⏱️ 44 minutes

🧾️ Download transcript

Summary

Fei-Fei Li and Justin Johnson are pioneers in AI. While the world has only recently witnessed a surge in consumer AI, they have long been laying the groundwork for the innovations transforming industries today. With the recent launch of Marble, the first product from their company World Labs, we are revisiting this conversation to explore the ideas that started it all. World Labs is focused on spatial intelligence, building Large World Models that can perceive, generate, and interact with the 3D world. Marble brings that vision to life, allowing anyone, from individual creators to major platforms, to generate 3D scenes directly from text or image prompts and turn complex 3D creation into a simple, creative process. In this episode, a16z general partner Martin Casado talks with Fei-Fei and Justin about the journey from early AI winters to the rise of deep learning and multimodal AI. From foundational breakthroughs like ImageNet to the cutting-edge realm of spatial intelligence, they discuss the evolution of the field and what is next for innovation at World Labs.

Transcript

Click on a timestamp to play from that location

0:00.0

This is fundamentally philosophically to be a different problem.

0:03.9

The previous decade had mostly been about understanding data that already exists,

0:08.8

but the next decade was going to be about understanding new data.

0:12.0

Visual, spatial intelligence is so fundamental. It's as fundamental as language.

0:19.0

It's like unwrapping presents on Christmas, that every day you know there's going to be some amazing new discovery, some amazing new application or algorithm somewhere.

0:26.7

If we see something or if we imagine something, both can converge towards generating it.

0:35.2

I think we're in the middle of a Cambrian explosion.

0:38.3

The next chapter of AI isn't about better language models.

0:43.3

It's about understanding the 3D world as fundamentally as we understand text.

0:48.3

Recently, World Labs launched Marble, their first product,

0:51.3

so we're replaying our most popular conversation to date, a discussion with

0:55.4

World Labs co-founders Faye-Lee and Justin Johnson about why spatial intelligence is the missing

1:00.5

piece for truly intelligent machines. Together with A16Z general partner, Martin Casato,

1:06.4

Fei-Fei and Justin talk about how ImageNet's million image bet in 2009 unlocked modern computer vision,

1:12.9

why today's multimodal models are still trapped in one dimension despite processing pixels

1:17.5

and how their team is building the infrastructure to generate fully interactive 3D worlds as

1:22.4

easily as we generate texts today. From the convergence of reconstruction and generation

1:27.1

that's redefining computer

1:28.4

vision to why AR, VR, and robotics desperately need native 3D understanding, this is the story

1:35.0

of four legendary researchers betting everything that the path to AGI runs through spatial intelligence.

1:40.9

Let's get into it. Over the last two years, we've seen this kind of massive rush of consumer AI companies and technology, and it's been quite wild.

1:51.0

But you've been doing this now for decades.

...

Please login to see the full transcript.

Disclaimer: The podcast and artwork embedded on this page are from a16z, and are the property of its owner and not affiliated with or endorsed by Tapesearch.

Generated transcripts are the property of a16z and are distributed freely under the Fair Use doctrine. Transcripts generated by Tapesearch are not guaranteed to be accurate.

Copyright © Tapesearch 2026.