The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

Update: 2023-07-20

Description

Video dominates modern media consumption, but video creation is still expensive and difficult. AI-generated and edited video is a holy grail of democratized creative expression. This week on No Priors, Sarah Guo and Elad Gil sit down with Devi Parikh. She is a Research Director in Generative AI at Meta and an Associate Professor in the School of Interactive Computing at Georgia Tech. Her work focuses on multimodality and AI for images, audio and video. Recently, she worked on Make a Video 3D, also called MAV3D, which creates animations from text prompts. She is also a talented AI-generated and analog artist herself.

Elad, Sarah and Devi talk about what’s exciting in computer vision, what’s blocking researchers from fully immersive Generative 4-D, and AI controllability.

No Priors is now on YouTube! Subscribe to the channel on YouTube and like this episode.

Show Links:

Devi Parikh - Google Scholar

Text-To-4D Dynamic Scene Generation named MAV3D (Make-A-Video3D)

Full Research Paper

Website with examples of image to 4 D generation

Devi’s Substack

Show Notes:

(0:00:06 ) - Democratizing Creative Expression With AI-Generated Video

(0:08:31 ) - Challenges in Video Generation Research

(0:15:57 ) - Challenges and Implications of Video Processing

(0:20:43 ) - Control and Multi-Modal Inputs in Video

(0:25:50 ) - Audio's Role in Visual Content

(0:39:00 ) - Don't Self-Select & Devi’s tips for young researchers

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

Gaming, Nobel Prizes and At-Risk Businesses in the AI Era

2024-10-1729:29

NVIDIA's Jensen Huang on AI Chip Design, Scaling Data Centers, and his 10-Year Bets

2024-11-0736:53

Forecasting the Future with Kalshi: America’s First Regulated Prediction Market

2024-10-3135:36

Waymo’s Journey to Full Autonomy: AI Breakthroughs, Safety, and Scaling

2024-10-2444:30

Launching AI products with Braintrust’s CEO Ankur Goyal

2024-10-0838:28

The Sheriff of Silicon Valley: Lina Khan’s FTC agenda for M&A, AI Acquisitions, and Non-Competes

2024-10-0326:08

Using AI to evaluate employee performance with Rippling’s COO Matt MacInnis

2024-09-2531:28

Transforming Customer Service through Company Agents, with Sierra’s Bret Taylor

2024-09-1948:30

Future of LLM Markets, Consolidation, and Small Models with Sarah and Elad

2024-09-1226:28

The Road to Autonomous Intelligence with Andrej Karpathy

2024-09-0544:16

Building toward a bright post-AGI future with Eric Steinberger from Magic.dev

2024-08-3037:49

Cloud Strategy in the AI Era with Matt Garman, CEO of AWS

2024-08-2942:58

The marketplace for AI compute with Jared Quincy Davis from Foundry

2024-08-2242:42

How AI can help build smarter systems for every team with Eric Glyman and Karim Atiyeh of Ramp

2024-08-1548:20

Innovating Spend Management through AI with Pedro Franceschi from Brex

2024-08-0833:39

Google DeepMind's Vision for AI, Search and Gemini with Oriol Vinyals from Google DeepMind

2024-08-0146:08

Low-Code in the Age of AI and Going Enterprise, with Howie Liu from Airtable

2024-07-2541:25

How AI is opening up new markets and impacting the startup status quo with Sarah Guo and Elad Gil

2024-07-1829:12

The Best of 2024 (so far) with Sarah Guo and Elad Gil

2024-07-1125:56

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

2024-06-2734:08

00:00

1.0x

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

#box-pro-ellipsis-173111572314456{-webkit-line-clamp:2;}The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

Conviction

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality