World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Update: 2025-12-06

Description

From building Medal into a 12M-user game clipping platform with 3.8B highlight moments to turning down a reported $500M offer from OpenAI (https://www.theinformation.com/articles/openai-offered-pay-500-million-startup-videogame-data) and raising a $134M seed from Khosla (https://techcrunch.com/2025/10/16/general-intuition-lands-134m-seed-to-teach-agents-spatial-reasoning-using-video-game-clips/) to spin out General Intuition, Pim is betting that world models trained on peak human gameplay are the next frontier after LLMs.

We sat down with Pim to dig into why game highlights are “episodic memory for simulation” (and how Medal’s privacy-first action labels became a world-model goldmine https://medal.tv/blog/posts/enabling-state-of-the-art-security-and-protections-on-medals-new-apm-and-controller-overlay-features), what it takes to build fully vision-based agents that just see frames and output actions in real time, how General Intuition transfers from games to real-world video and then into robotics, why world models and LLMs are complementary rather than rivals, what founders with proprietary datasets should know before selling or licensing to labs, and his bet that spatial-temporal foundation models will power 80% of future atoms-to-atoms interactions in both simulation and the real world.

We discuss:

How Medal’s 3.8B action-labeled highlight clips became a privacy-preserving goldmine for world models
Building fully vision-based agents that only see frames and output actions yet play like (and sometimes better than) humans
Transferring from arcade-style games to realistic games to real-world video using the same perception–action recipe
Why world models need actions, memory, and partial observability (smoke, occlusion, camera shake) vs. “just” pretty video generation
Distilling giant policies into tiny real-time models that still navigate, hide, and peek corners like real players
Pim’s path from RuneScape private servers, Tourette’s, and reverse engineering to leading a frontier world-model lab
How data-rich founders should think about valuing their datasets, negotiating with big labs, and deciding when to go independent
GI’s first customers: replacing brittle behavior trees in games, engines, and controller-based robots with a “frames in, actions out” API
Using Medal clips as “episodic memory of simulation” to move from imitation learning to RL via world models and negative events
The 2030 vision: spatial–temporal foundation models that power the majority of atoms-to-atoms interactions in simulation and the real world

—

Pim

X: https://x.com/PimDeWitte
LinkedIn: https://www.linkedin.com/in/pimdw/

Where to find Latent Space

X: https://x.com/latentspacepod
Substack: https://www.latent.space/

Chapters

00:00:00 Introduction and Medal's Gaming Data Advantage
00:02:08 Exclusive Demo: Vision-Based Gaming Agents
00:06:17 Action Prediction and Real-World Video Transfer
00:08:41 World Models: Interactive Video Generation
00:13:42 From Runescape to AI: Pim's Founder Journey
00:16:45 The Research Foundations: Diamond, Genie, and SEMA
00:33:03 Vinod Khosla's Largest Seed Bet Since OpenAI
00:35:04 Data Moats and Why GI Stayed Independent
00:38:42 Self-Teaching AI Fundamentals: The Francois Fleuret Course
00:40:28 Defining World Models vs Video Generation
00:41:52 Why Simulation Complexity Favors World Models
00:43:30 World Labs, Yann LeCun, and the Spatial Intelligence Race
00:50:08 Business Model: APIs, Agents, and Game Developer Partnerships
00:58:57 From Imitation Learning to RL: Making Clips Playable
01:00:15 Open Research, Academic Partnerships, and Hiring
01:02:09 2030 Vision: 80 Percent of Atoms-to-Atoms AI Interactions

Comments

In Channel

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

2025-12-0601:04:16

After LLMs: Spatial Intelligence and World Models — Fei-Fei Li & Justin Johnson, World Labs

2025-11-2501:00:38

⚡️ 10x AI Engineers with $1m Salaries — Alex Lieberman & Arman Hezarkhani, Tenex

2025-11-1927:10

Anthropic, Glean & OpenRouter: How AI Moats Are Built with Deedy Das of Menlo Ventures

2025-11-1401:25:27

⚡ Inside GitHub’s AI Revolution: Jared Palmer Reveals Agent HQ & The Future of Coding Agents

2025-11-1035:51

⚡ [AIE CODE Preview] Inside Google Labs: Building The Gemini Coding Agent — Jed Borovik, Jules

2025-11-1043:52

Priscilla Chan and Mark Zuckerberg: Frontier AI + Virtual Biology To Solve All Diseases

2025-11-0653:33

⚡️ Ship AI recap: Agents, Workflows, and Python — w/ Vercel CTO Malte Ubl

2025-10-3142:01

The Agents Economy Backbone - with Emily Glassberg Sands, Head of Data & AI at Stripe

2025-10-3001:37:12

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

2025-10-1601:08:22

DevDay 2025: Apps SDK, Agent Kit, MCP, Codex and why Prompting is More Important than Ever

2025-10-0745:07

Taste is your Moat (Dylan Field of Figma)

2025-10-0201:01:42

Amp: The Emperor Has No Clothes

2025-09-2501:20:12

Context Engineering for Agents - Lance Martin, LangChain

2025-09-1157:32

A Technical History of Generative Media

2025-09-0501:01:09

Better Data is All You Need — Ari Morcos, Datology

2025-08-2901:18:42

Long Live Context Engineering - with Jeff Huber of Chroma

2025-08-1957:00

Greg Brockman on OpenAI's Road to AGI

2025-08-1501:08:36

The RLVR Revolution — with Nathan Lambert (AI2, Interconnects.ai)

2025-07-3101:18:59

AI is Eating Search

2025-07-2356:21

00:00

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

#box-pro-ellipsis-176515022806663{-webkit-line-clamp:2;}World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

Chapters

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI

World Models & General Intuition: Khosla's largest bet since LLMs & OpenAI