Listen Top Shows Blog

Paul Christiano - Preventing an AI Takeover

Paul Christiano - Preventing an AI Takeover

Update: 2023-10-31

1

Share

Description

Paul Christiano is the world’s leading AI safety researcher. My full episode with him is out!

We discuss:

- Does he regret inventing RLHF, and is alignment necessarily dual-use?

- Why he has relatively modest timelines (40% by 2040, 15% by 2030),

- What do we want post-AGI world to look like (do we want to keep gods enslaved forever)?

- Why he’s leading the push to get to labs develop responsible scaling policies, and what it would take to prevent an AI coup or bioweapon,

- His current research into a new proof system, and how this could solve alignment by explaining model's behavior

- and much more.

Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here. Follow me on Twitter for updates on future episodes.

Open Philanthropy

Open Philanthropy is currently hiring for twenty-two different roles to reduce catastrophic risks from fast-moving advances in AI and biotechnology, including grantmaking, research, and operations.

For more information and to apply, please see the application: https://www.openphilanthropy.org/research/new-roles-on-our-gcr-team/

The deadline to apply is November 9th; make sure to check out those roles before they close.

Timestamps

(00:00:00 ) - What do we want post-AGI world to look like?

(00:24:25 ) - Timelines

(00:45:28 ) - Evolution vs gradient descent

(00:54:53 ) - Misalignment and takeover

(01:17:23 ) - Is alignment dual-use?

(01:31:38 ) - Responsible scaling policies

(01:58:25 ) - Paul’s alignment research

(02:35:01 ) - Will this revolutionize theoretical CS and math?

(02:46:11 ) - How Paul invented RLHF

(02:55:10 ) - Disagreements with Carl Shulman

(03:01:53 ) - Long TSMC but not NVIDIA

Get full access to Dwarkesh Podcast at www.dwarkeshpatel.com/subscribe

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

David Reich - How One Small Tribe Conquered the World 70,000 Years Ago

David Reich - How One Small Tribe Conquered the World 70,000 Years Ago

2024-08-2901:56:06

Joe Carlsmith - Otherness and control in the age of AGI

Joe Carlsmith - Otherness and control in the age of AGI

2024-08-2202:30:35

Patrick McKenzie - How a Discord Server Saved Thousands of Lives

Patrick McKenzie - How a Discord Server Saved Thousands of Lives

2024-07-2402:01:34

Tony Blair - Life of a PM, The Deep State, Lee Kuan Yew, & AI's 1914 Moment

Tony Blair - Life of a PM, The Deep State, Lee Kuan Yew, & AI's 1914 Moment

2024-06-2652:52

Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution

2024-06-1101:33:53

Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History

Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History

2024-06-0404:31:18

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

John Schulman (OpenAI Cofounder) - Reasoning, RLHF, & Plan for 2027 AGI

2024-05-1501:36:29

Mark Zuckerberg - Llama 3, Open Sourcing $10b Models, & Caesar Augustus

Mark Zuckerberg - Llama 3, Open Sourcing $10b Models, & Caesar Augustus

2024-04-1801:17:54

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind

2024-03-2803:12:21

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat

2024-02-2801:01:33

Patrick Collison (Stripe CEO) - Craft, Beauty, & The Future of Payments

Patrick Collison (Stripe CEO) - Craft, Beauty, & The Future of Payments

2024-02-2101:54:47

Tyler Cowen - Hayek, Keynes, & Smith on AI, Animal Spirits, Anarchy, & Growth

Tyler Cowen - Hayek, Keynes, & Smith on AI, Animal Spirits, Anarchy, & Growth

2024-01-3101:42:22

Lessons from The Years of Lyndon Johnson by Robert Caro [Narration]

Lessons from The Years of Lyndon Johnson by Robert Caro [Narration]

2024-01-2336:32

Will scaling work? [Narration]

Will scaling work? [Narration]

2024-01-1925:43

Jung Chang - Living through Cultural Revolution and the Crimes of Mao

Jung Chang - Living through Cultural Revolution and the Crimes of Mao

2023-11-2901:31:15

Andrew Roberts - SV's Napoleon Cult, Why Hitler Lost WW2, Churchill as Applied Historian

Andrew Roberts - SV's Napoleon Cult, Why Hitler Lost WW2, Churchill as Applied Historian

2023-11-2201:18:49

Dominic Cummings - COVID, Brexit, & Fixing Western Governance

Dominic Cummings - COVID, Brexit, & Fixing Western Governance

2023-11-1502:34:13

Paul Christiano - Preventing an AI Takeover

Paul Christiano - Preventing an AI Takeover

2023-10-3103:07:01

Shane Legg (DeepMind Founder) - 2028 AGI, New Architectures, Aligning Superhuman Models

Shane Legg (DeepMind Founder) - 2028 AGI, New Architectures, Aligning Superhuman Models

2023-10-2644:19

Grant Sanderson (3Blue1Brown) - Past, Present, & Future of Mathematics

Grant Sanderson (3Blue1Brown) - Past, Present, & Future of Mathematics

2023-10-1201:31:20

00:00

00:00

x

Paul Christiano - Preventing an AI Takeover

Paul Christiano - Preventing an AI Takeover

Dwarkesh Patel