740: Local AI Models in JavaScript - Machine Learning Deep Dive With Xenova
Update: 2024-03-082
Description
Scott and Wes are joined by special guest Xenova to explore local AI models in JavaScript. From Hugging Face to Transformers.js and practical applications like real-time speech recognition and object detection, this episode dives deep into the world of machine learning.
Show Notes
- 00:00 ">00:00 Welcome to Syntax!
- 00:41 ">00:41 Brought to you by Sentry.io
- 01:05 ">01:05 Who is Xenova?
- 02:08 ">02:08 What is Hugging Face?
- 03:29 ">03:29 What is Transformers.js?
- 06:16 ">06:16 How was the library developed?
- SponsorBlock
- 09:04 ">09:04 How is it able to run?
- 10:09 ">10:09 Do they have to run in Python and how does Onnx work?
- Onnx.ai
- Hugging Face Optimum
- 14:19 ">14:19 What are some things you can do with this tech?
- 16:15 ">16:15 Vision tools.
- 17:38 ">17:38 This is actually running locally.
- 18:35 ">18:35 Doodle Dash
- 21:09 ">21:09 They currently run on CPU, what is required to make it run on GPU?
- 24:44 ">24:44 Can you run in JavaScript?
- 28:32 ">28:32 How it works with image vectors.
- 34:23 ">34:23 Why would people want to run it in another language?
- 35:55 ">35:55 Resizing images in the browser instead of on the server.
- 38:55 ">38:55 Applications distributed on the web vs running locally.
- 43:54 ">43:54 Electron has Node and Chrome, where would you run Transformers.js?
- 44:32 ">44:32 The API of Transformers.js
- 46:30 ">46:30 Object Detection.
- Semantic Image Search Client
- Real-Time Object Detection
- Background Removal Tool
- 48:33 ">48:33 What is the easiest way to get started?
- 51:26 ">51:26 Real-time speech recognition on the horizon?
- 52:08 ">52:08 Will we ever be able to run Stable Diffusion via JavaScript?
- 56:10 ">56:10 The Web LLM.
- 57:22 ">57:22 Practical applications for YouTube.
- 59:39 ">59:39 What we want to build for Syntax.fm.
- 01:06:43 ">01:06:43 Mean pooling, why it’s necessary.
- 01:09:30 ">01:09:30 Stopping YouTube spam comments.
- 01:10:34 ">01:10:34 K-Means Clustering.
- Text Clustering
- 01:13:49 ">01:13:49 Quantization.
- 01:17:35 ">01:17:35 Sick Picks + Shameless Plugs.
Sick Picks
- Xeonva: WebGPU
Shameless Plugs
- Xenova: Xenova on X
Hit us up on Socials!
Syntax: X Instagram Tiktok LinkedIn Threads
Wes: X Instagram Tiktok LinkedIn Threads
Comments
Top Podcasts
The Best New Comedy Podcast Right Now – June 2024The Best News Podcast Right Now – June 2024The Best New Business Podcast Right Now – June 2024The Best New Sports Podcast Right Now – June 2024The Best New True Crime Podcast Right Now – June 2024The Best New Joe Rogan Experience Podcast Right Now – June 20The Best New Dan Bongino Show Podcast Right Now – June 20The Best New Mark Levin Podcast – June 2024
In Channel