DiscoverSuper Data Science: ML & AI Podcast with Jon Krohn711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain
711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain

711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain

Update: 2023-09-05
Share

Description

In this episode, host Jon Krohn explores with his guest Ajay Jain, Co-Founder of Genmo.ai, how creative general intelligence could take the video industry by storm. They also discuss the models that got Genmo to this point, the applications of NeRF, and how understanding human psychology is so essential to developing models that output high-fidelity video.

This episode is brought to you by the Zerve data science dev environment (https://zerve.ai), by Grafbase (https://grafbase.com), the unified data layer, and by Modelbit (https://modelbit.com), for deploying models in seconds. Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast for sponsorship information.

In this episode you will learn:
• About Genmo.ai and the term “creative general intelligence” [03:47 ]
• Why Ajay started Genmo.ai [09:26 ]
• The increased performance of multimodal models [21:12 ]
• All about Denoising Diffusion Probabilistic Models (DDPMs) [31:03 ]
• The application of Neural Radiance Fields (NeRF) [55:26 ]
• Predicting pedestrian behavior at Uber [1:01:50 ]
• How to save money in the process of training models [1:12:42 ]

Additional materials: www.superdatascience.com/711
Comments 
loading
In Channel
loading
00:00
00:00
1.0x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain

711: Image, Video and 3D-Model Generation from Natural Language, with Dr. Ajay Jain

Jon Krohn