Listen Top Shows Blog

Bringing Whisper and LLaMA to the masses (Interview)

Bringing Whisper and LLaMA to the masses (Interview)

Update: 2023-03-22

Share

Description

This week we’re talking with Georgi Gerganov about his work on Whisper.cpp and llama.cpp. Georgi first crossed our radar with whisper.cpp, his port of OpenAI’s Whisper model in C and C++. Whisper is a speech recognition model enabling audio transcription and translation. Something we’re paying close attention to here at Changelog, for obvious reasons. Between the invite and the show’s recording, he had a new hit project on his hands: llama.cpp. This is a port of Facebook’s LLaMA model in C and C++. Whisper.cpp made a splash, but llama.cpp is growing in GitHub stars faster than Stable Diffusion did, which was a rocket ship itself.

Join the discussion

Changelog++ members get a bonus 12 minutes at the end of this episode and zero ads. Join today!

Sponsors:

Postman – Build APIs together — More than 20 million developers use Postman for building and using APIs. Postman simplifies each step of the API lifecycle and streamlines collaboration so you can create better APIs—faster.

Sentry – Session Replay! Rewind and replay every step of the user’s journey before and after they encountered an issue. Eliminate the guesswork and get to the root cause of an issue, faster. Use the code CHANGELOG and get the team plan free for three months.

Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com

Typesense – Lightning fast, globally distributed Search-as-a-Service that runs in memory. You literally can’t get any faster!

Featuring:

Georgi Gerganov – Mastodon, Twitter, GitHub, Website
Adam Stacoviak – Mastodon, Twitter, GitHub, LinkedIn, Website
Jerod Santo – Mastodon, Twitter, GitHub, LinkedIn

Show Notes:

ggerganov/whisper.cpp

examples/main

Arm Neon technology

Apple’s secret M1 coprocessor

ggerganov/llama.cpp

Introducing LLaMA: A foundational, 65-billion-parameter large language model

facebookresearch/llama

Ludacris Llama Llama Red Pajama Freestyle

The Changelog #506: Stable Diffusion breaks the internet with Simon Willison

Large language models are having their Stable Diffusion moment

Something missing or broken? PRs welcome!

Comments

Top Podcasts

The Best New Comedy Podcast Right Now – June 2024 The Best News Podcast Right Now – June 2024 The Best New Business Podcast Right Now – June 2024 The Best New Sports Podcast Right Now – June 2024 The Best New True Crime Podcast Right Now – June 2024 The Best New Joe Rogan Experience Podcast Right Now – June 20 The Best New Dan Bongino Show Podcast Right Now – June 20 The Best New Mark Levin Podcast – June 2024

In Channel

ANTHOLOGY – Self-hosted, self-confident & self-employed (Friends)

ANTHOLOGY – Self-hosted, self-confident & self-employed (Friends)

2024-11-0801:27:11

ANTHOLOGY — Packages, pledges & protocols (Interview)

ANTHOLOGY — Packages, pledges & protocols (Interview)

2024-11-0601:45:36

Tactile controls are back in vogue (News)

Tactile controls are back in vogue (News)

2024-11-0409:14

Wine Web and a whole lot of Whatnot (Friends)

Wine Web and a whole lot of Whatnot (Friends)

2024-11-0101:12:48

Rails is having a moment (again) (Interview)

Rails is having a moment (again) (Interview)

2024-10-3102:02:12

Developing with Docker (the right way) (News)

Developing with Docker (the right way) (News)

2024-10-2807:24

Ten years of freeCodeCamp (Friends)

Ten years of freeCodeCamp (Friends)

2024-10-2501:42:52

Elasticsearch is open source, again (Interview)

Elasticsearch is open source, again (Interview)

2024-10-2401:23:47

Naming conventions that need to die (News)

Naming conventions that need to die (News)

2024-10-2109:26

You'll rent chips and be happy (Friends)

You'll rent chips and be happy (Friends)

2024-10-1801:38:10

Lessons from 10k hours of programming (Remastered) (Interview)

Lessons from 10k hours of programming (Remastered) (Interview)

2024-10-1701:23:19

Working from home is powering productivity (News)

Working from home is powering productivity (News)

2024-10-1408:16

The indispensable cog (Friends)

The indispensable cog (Friends)

2024-10-1101:23:19

The Moneyball approach (Interview)

The Moneyball approach (Interview)

2024-10-1001:46:40

The slow death of the hyperlink (News)

The slow death of the hyperlink (News)

2024-10-0709:06

Developer (un)happiness (Friends)

Developer (un)happiness (Friends)

2024-10-0401:46:07

Free-threaded Python (Interview)

Free-threaded Python (Interview)

2024-10-0201:26:50

Display custom maps on your website for free (News)

Display custom maps on your website for free (News)

2024-09-3010:14

The wrong place to slap a person (Friends)

The wrong place to slap a person (Friends)

2024-09-2701:39:10

Leveling up JavaScript with Deno 2 (Interview)

Leveling up JavaScript with Deno 2 (Interview)

2024-09-2601:15:12

00:00

00:00

x

Bringing Whisper and LLaMA to the masses (Interview)

Bringing Whisper and LLaMA to the masses (Interview)