Discover
"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
Author: Erik Torenberg, Nathan Labenz
Subscribed: 130Played: 1,934Subscribe
Share
© 2023
Description
A biweekly podcast where hosts Nathan Labenz and Erik Torenberg interview the builders on the edge of AI and explore the dramatic shift it will unlock in the coming years.
The Cognitive Revolution is part of the Turpentine podcast network. To learn more: turpentine.co
64 Episodes
Reverse
This 20-min episode comes from our friend Nathaniel Whittemore's excellent daily podcast The AI Breakdown Podcast. This episode aired on June 1, 2023, and covers the latest developments from OpenAI, including new features, a cybersecurity grant program, and their new process rewards model for trading. We hope you enjoy it as much as we do.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
Subscribe to The AI Breakdown podcast: https://pod.link/1680633614
Subscribe to The AI Breakdown on YouTube: https://www.youtube.com/@TheAIBreakdown
In this episode, Nathan sits down with three researchers at Carnegie Mellon studying adversarial attacks and mimetic initialization: Zico Kolter, Andy Zou, and Asher Trockman. They discuss: the motivation behind researching universal adversarial attacks on language models, how the attacks work, and the short term harms and long term risks of these jailbreaks. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
TIMESTAMPS:
[00:00:00] - Introducing the podcast and guests Zico Kolter, Andy Zou, and Asher Trockman
[00:06:32] - Discussing the motivation and high-level strategy for the universal adversarial attack on language models
[00:09:33] - Explaining how the attacks work by adding nonsense tokens to maximize target sequence probability
[00:11:06] - Comparing to prior adversarial attacks in vision models
[00:13:47] - Details on the attack optimization process and discrete token search
[00:17:09] - The empirical notion of "mode switching" in the language models
[00:21:18] - Technical details on gradient computation across multiple models and prompts
[00:23:46] - Operating in one-hot vector space rather than continuous embeddings
[00:25:50] - Evaluating candidate substitutions across all positions to find the best update
[00:28:05] - Running the attack optimization for hundreds of steps across multiple GPUs
[00:39:14] - The difficulty of understanding the loss landscape and internal model workings
[00:43:55] - The flexibility afforded by separating the loss and optimization approach
[00:48:16] - The challenges of creating inherently robust models via adversarial training
[00:52:34] - Potential approaches to defense through filtering or inherent model robustness
[00:55:51] - Transferability results to commercial models like GPT-4 and Claude
[00:59:25] - Hypotheses on why the attacks transfer across different model architectures
[01:04:36] - The mix of human-interpretable and nonsense features in effective attacks
[01:08:29] - The appearance of intuitive manual jailbreak triggers in some attacks
[01:15:33] - Short-term harms of attacks vs long-term risks
[01:18:37] - Influencing those with incomplete understanding of LLMs to appreciate differences from human reasoning
[01:24:16] - Mitigating risks by training on filtered datasets vs broad web data
[01:2916] - Curriculum learning as a strategy for both capability and safety
[01:30:35] - Influencing developers building autonomous systems with LLMs
[01:33:19] - Alienness of LLM failure modes compared to human reasoning
[01:35:45] - Getting inspiration from biological visual system structure
[01:40:35] - Initialization as an alternative to pretraining for small datasets
[01:51:41] - Encoding useful structures like grammars in initialization without training
[02:12:10] - Most ideas don't progress to research projects
[02:13:02] - Pursuing ideas based on interest and feasibility
[02:15:14] - Fun of exploring uncharted territory in ML research
LINKS:
Adversarial Attacks Paper: https://arxiv.org/abs/2307.15043
Mimetic Initialization on Self-Attention Layers: https://arxiv.org/pdf/2305.09828.pdf
X/Social:
@zicokolter (Zico Kolter)
@andyzou_jiaming (Andy Zou)
@ashertrockman (Asher Trockman)
@CogRev_podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: Stableaudio.com
In this episode, Nathan sits down with Adam Wenchel, CEO of Arthur.ai. Adam founded the AI security company back in 2019, before GPT-2 existed. In this episode, Adam shares his unique perspective on the AI security landscape, drawing from years building commercial AI systems. They discuss the attacks Adam set out to defend against, the changing priorities of executives in the rush to adopt LLMs, and the LLM-specific techniques Adam has developed. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
TIMESTAMPS:
(00:00:00) Episode Preview
(00:03:45) Adam's background in AI and starting Arthur AI in 2019
(00:05:52) The release of ChatGPT as a watershed moment for generative AI
(00:07:09) Differences between traditional cybersecurity and AI security
(00:09:51) Early examples of AI security issues like boundary detection attacks in fraud systems
(00:12:39) - Mitigating risks of AI systems through observability and robust training
(00:14:40) - Financial services governance of AI models and its challenges today
(00:15:12) Sponsors: Netsuite | Omneky
(00:21:18) - Motivations for governance like staying compliant with regulations
(00:21:40) - The mix of incentives shaping earlier AI governance, like explainability
(00:28:14) - Using LMs to evaluate the security of other LMs
(00:30:03) - Dynamics between training and evaluating future LMs
(00:38:10) - The state of reasoning capabilities in large LMs
(00:44:35) - Corporate urgency around adopting generative AI technologies
(00:46:51) - Common enterprise use cases for generative AI and security concerns
(00:50:45) - Techniques for reducing hallucinations in retrieval augmented LMs
(00:53:15) - Benchmarking LMs on specific organizational tasks versus generic benchmarks
(00:56:30) - Metrics beyond accuracy like concision and hedging
(01:01:20) - Automatically detecting anomalies and hallucinations
(01:09:20) - Relationships between Arthur AI and foundation model providers
(01:11:52) - Where Cohere shines: multilingualism and not hedging
(01:13:43) - Anticipating future watershed moments and steady progress
(01:19:03) - Whether we can ever fully solve AI alignment and safety
LINKS:
Arthur.ai: https://www.arthur.ai/
X/Social:
@apwenchel (Adam)
@itsArthurAI (Arthur.ai)
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: Stableaudio.com
In this episode, Nathan sits down with Stephen Parker and Josh Rubin of Waymark, and creators of The Frost, an AI-powered 12 minute short film. In this episode, we get a behind the scenes look at their creative process, the prompting and creative techniques they used to generate and animate the DALL-E results, and an overview of the current state of AI art. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
TIMESTAMPS
(00:00) Episode Preview
(00:01:00) Nathan’s introduction for Stephen Parker and Josh Rubin
(00:05:01) - The Frost is a 12-minute short film created using DALL-E 2 images.
(00:07:06) - The Frost started as an experiment to see if a narrative film could be created completely from AI imagery.
(00:08:38) - The filmmaking process was different because DALL-E images provided a starting point to build the story.
(00:10:38) - Parker started generating images with DALL-E 2 when he got access to the early preview.
(00:12:26) - Prompt technique to get consistent images by providing context about a hypothetical film.
(00:15:57) Sponsors: Netsuite | Omneky
(00:19:37) - Compositional continuity, like shot-reverse shot, was hard to achieve through prompting.
(00:22:13)- Rubin would request specific shots and the team would prompt DALL-E 2 to create them.
(00:25:24) - Filmmaking with AI as opposed to traditional filmmaking
(00:32:25) - Getting consistent facial features for characters was very difficult.
(00:39:03) - The storytelling helped cover inconsistencies that viewers might not notice.
(00:40:15) - Working with the images DALL-E provides
(00:41:54) - MacGuffin Object to tie scenes together
(00:44:53) Inpainting and compositing to refine DALL-E Images
(00:45:41) - Prompting for complex or novel compositions remains challenging.
(00:50:43) - The AI art is limited by what exists in the training data.
(01:02:05)- Animating the human characters was challenging because of missing or incorrect appendages.
(01:07:36) - The team had to find creative ways to convey emotion through the limited animation.
01:02:24 - Animating subtle human movement and emotion is still very difficult.
(01:06:35) - A romantic comedy would be much harder to produce with current AI capabilities.
(01:12:17) - For Frost 2 they are using text-to-video models like RunwayML.
(01:15:43) - AI voicing advancements applied to filmmaking
(01:19:27) - The future of AI in Hollywood and filmmaking: quality narratives still require human vision
LINKS:
The Frost: https://www.thefrostpart.one/
MIT Tech Review Feature Article: https://www.technologyreview.com/2023/06/01/1073858/surreal-ai-generative-video-changing-film/
Behind the Scenes Videos: https://www.youtube.com/watch?v=p31COxNbTWs and https://www.youtube.com/watch?v=F8k9MeXpSUU
The Frost Part 2 – trailer – https://www.youtube.com/watch?v=RcmwtRd_NIs
X/SOCIAL:
@Stephen_Parker (Stephen)
@bigkickcreative (Josh)
@Waymark
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
In this episode, Trey Kollmer, WGA Writer and Co-Executive of the show Ghosts, returns to the show to discuss updates to the Hollywood Strikes, including news on SAG-AFTRA and WGA. Trey and Nathan chat why actors are joining the strikes, how AI will change acting as a profession, Trey’s views on reasoning and how he’s experimenting with GPT-4, and much, much more.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
TIMESTAMPS:
(00:04:01) Hollywood and AI: updates on SAG-AFTRA and the WGA
(00:15:20) Sponsors: NetSuite | Omneky
(00:18:56) Studio approach to copyright and compensation with generative AI
(00:22:17) Hollywood receptiveness to using AI and protections guild members are asking for
(00:24:05) How much potential is there in fine-tuning models on writing Hollywood scripts?
(00:24:56) Models implementing gradient descent in the weights
(00:29:14) How Nathan uses models to write for the podcast
(00:34:15) Generating and mining jokes in the writer’s room
(00:35:18) Generating polarizing material
(00:40:18) Untraining models
(00:44:20) AI writing tools and writer perception of them
(00:46:34) Context length and pooling layers
(00:51:11) Microsoft China
(00:52:09) Chat-GPT’s system prompt: steering the model in the direction you want
(01:00:02) Actors’ strike
(01:02:20) Background actor rights
(01:05:45) Using 1 million DALL-E images to create an AI short film
(01:09:11) Deepfakes
(01:10:29) Speculation on outcomes for the actor’s strike
(01:12:51) The future where anyone can be a reasonable synthetic actor
(01:17:33) Trey’s take on reasoning and why Hollywood should be more open to AI
(01:16:20) New generative choose your own adventure content
(01:25:49) A monk’s experience with Chat-GPT
(01:28:56) Outdatedness of stochastic parrot notion; reasoning and synthesis
(01:32:05) Adversarial attacks
(01:36:14) Model vs human susceptibility to adversarial attacks because of human robustness
(01:39:20) Trey and Nathan’s reasoning experiments
(01:47:30) Performance jumping with abstraction
(01:49:22) Language model self-delegation
(01:51:04) NVIDIA’s margins compared to TSMC and ASML
(02:07:35) Adding an AI layer and competing with incumbents
(02:09:22) How sustainable is the demand for an AI friend?
(02:13:10) Rewind AI
(02:19:27) Big tech vs old school studios
(02:14:52) Dramatic ironies from the picketing lines
(02:21:38) AI development moments that feel like a movie
(02:26:34) CEO of Inflection’s views on misuse being a greater threat than the AI itself
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
In this episode, Nathan sits down with three members of the a16z x Convex AI Town project: Yoko Li (Partner, a16z), Martin Casado (GP, a16z), and James Cowling (CTO, Convex). AI Town is a virtual town where AI agents live, interact, and socialize. They discuss how AI Town originated from Yoko’s companion app project, unpredictability as a feature in LLMs and interacting with models like they are lifeforms, and why they chose Javascript and Convex to build AI Town. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
📣 CALL FOR FEEDBACK:
To borrow from a meme… we're in the podcast arena trying stuff. Some will work. Some won't. But we're always learning.
http://bit.ly/TCRFeedback
Fill out the above form to let us know how we can continue delivering great content to you or sending the feedback on your mind to tcr@turpentine.co.
TIMESTAMPS:
(00:00:00) - Episode Preview: Intro to AI Town and the idea of AI as companions
(00:05:29) - Overview of AI Town and the simulation framework based on the Stanford Generative Agents paper
(00:08:24) - Yoko explains how the idea for AI Town originated from the companion app project
(00:10:41) - Yoko discusses how she built the initial AI Town prototype and wanted to make it multiplayer
(00:12:31) - The simplicity and elegance of the AI Town codebase
(00:13:52) - Interacting with LLMs is like interacting with lifeforms
(00:15:47) - Sponsors: Netsuite | Omneky
(00:18:25) - How Convex built a server-side game engine for AI Town
(00:19:28) - How Convex makes building a game engine easy with transactions and database support
(00:23:39 )- James emphasizes the power of functional programming paradigms like Convex for building AI apps
(00:25:02) - Using simple JavaScript so anyone could understand and extend AI Town
(00:28:39) - The group reflects on how JavaScript has become so powerful compared to languages like C++
(00:30:23) - How AI coding assistants were used in building AI Town
(00:31:22) - No open source code for the Stanford paper when they started
(00:33:25) - The interplay between programmer and AI model
(00:38:01)- Martin draws a distinction between using formal languages vs. natural language
(00:39:52) Unpredictability as a feature in LLMs
(00:43:21) The balance between formal language and unpredictable behaviours in LLMs
(00:43:59) AI Town’s future and the beauty of the community
(00:48:27) Are we living in a simulation?
(00:50:38) Advice for other developers in AI
(00:54:29) AI Town is a community project to be extended on
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
In this episode, Nathan sits down with Daniel Kang, Assistant Professor of Computer Science at the University of Illinois. Kang has done pioneering work bringing zero knowledge cryptographic proofs to AI. In this episode, they chat about the cryptographic theory behind Daniel's work, how cryptography allows us to balance the tradeoff between privacy and authenticity, and how cryptography usage is needed in a world where LLMs are increasingly embedded into our daily lives. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
CALL FOR FEEDBACK:
To borrow from a meme… we're in the podcast arena trying stuff. Some will work. Some won't. But we're always learning.
http://bit.ly/TCRFeedback
Fill out the above form to let us know how we can continue delivering great content to you or sending the feedback on your mind to tcr@turpentine.co.
TIMESTAMPS:
(00:00) Episode Preview
(00:01:04) Nathan's Introduction
(00:07:06) Motivation for bringing zero-knowledge proofs to AI
(00:07:53) Verifying humanness without revealing personal information
(00:10:27) Verifying model execution without revealing model details
(00:12:42) Verifying medical AI services haven't been tampered with
(00:13:51) Overview of zero-knowledge proof protocol
(00:15:09) Sponsors: Netsuite | Omneky
(00:18:54) Cryptographic hashes for commitments
(00:22:42) Assumptions underlying cryptographic hashes
(00:24:17) Hash collisions
(00:25:20) Adding entropy through salting
(00:26:24) Z case snarks and the proving process
(00:31:00) Using lookup tables for nonlinearities
(00:33:35) Floating point vs fixed point calculations
(00:34:08) Quantizing models for efficiency
(00:35:55) Using polynomials to represent arbitrary computations
(00:37:26) What are finite fields?
(00:41:23) Toxic waste for cryptographic secrecy
(00:45:51) Computational costs
(00:47:39) The experience of using a cryptography application to verify model output
(00:49:05) Verification key doesn't reveal model weights
(00:56:36) What using crypto infrastructure in AI enables and challenges to its implementation
(01:01:26) Potential for 10-100x cost reductions
(01:04:51) Authenticating images with attested cameras
(01:11:56) How cryptography in AI could impact daily life
(01:14:25) On-device credential verification
(01:15:50) Potential for regulation of hardware authentication
(01:18:52) Upcoming work to reduce proof costs
LINKS:
Daniel's website
X/TWITTER:
@daniel_d_kang (Daniel)
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
In this episode, Nathan and Erik sit down to analyze Hugging Face in light of its recent $235M Series D round. They analyze Hugging Face’s community and defensibility through the lens of other community businesses like ProductHunt and Yelp, assess its ability to fulfill its $4.5 billion valuation, and assess competitors and other notable companies in the space like Replit, Character, and Runway. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00:57) Episode Preview
(00:01:52) Nathan’s Introduction
(00:04:57) Overview of Hugging Face and recent fundraising announcement
(00:08:13) Hugging Face’s product line
(00:16:47) Sponsors: Netsuite | Omneky
(00:18:36) Community driven businesses and HuggingFace’s moat
(00:23:23) Discovery and inference
(00:28:04) Hugging Face’s ideological nature
(00:31:31) Curation is key
(00:33:46) Keeping content fresh when AI moves so fast
(00:35:44) If Hugging Face is a $50 billion company one day, what would it look like?
(00:48:08) Hugging Face vs Replit
(00:57:19) Contrasting Hugging Face with other companies like Character and Runway
LINKS MENTIONED:
Hugging Face's LLM leaderboard
X/TWITTER:
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
In this episode, Nathan sits down with Paige Bailey, Lead Product Manager of Generative Models at Google Deepmind. In this conversation, they discuss what it's like to be a PM for an LLM as opposed to an app, defining ideal LLM behaviour, and reasoning - how do you distinguish real abilities vs pattern matching? If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
TIMESTAMPS:
(00:00) Episode Preview
(00:01:15) Introducing Paige Bailey
(00:04:21) Paige’s background at Google Brain and the Deepmind merger
(00:07:00) PM for a LLM vs being a PM for an app
(00:11:21) The development timeline and compute budget of PaLM-2
(00:14:30) Paige’s role in the PaLM 2 project
(00:15:30) Sponsors: Netsuite | Omneky
(00:17:26) Defining desired capabilities for PaLM-2
(00:19:17) The amount of work that went into elevating PaLM 2 from PaLM 1
(00:20:28) Has Google lost its ability to ship?
(00:24:240) Paige's "eureka" moment seeing GitHub Copilot capabilities
(00:27:47) Competing PaLM 2 with other models
(00:32:20) Grokking and the predictability of emergent capabilities
(00:37:30) Citizen scientists and the multilingual capabilities of PaLM 2
(00:39:29) Distinguishing real reasoning vs pattern matching
(00:45:51) Products using PaLM-2 that people should try
(00:50:35) Most exciting AI projects that you can try out
(00:52:29) Curriculum learning and successor to the transformer
LINKS:
PaLM 2
Duet AI for developers
Avenging Polayni’s Revenge
X/TWITTER:
@DynamicWebPaige (Paige)
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music Credit: GoogleLM
Join Nathan Labenz and Erik Torenberg as they analyze the latest developments from OpenAI on GPT 3.5, compare GPT to other live player models like Llama2, and discuss the state of AI in coding, education, and healthcare. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode Preview
(00:01:00) GPT 3.5 Turbo
(00:06:36) Llama 2 vs GPT
(00:11:24) How much inference is needed to double compute costs?
(00:13:40) OpenAI’s moat
(00:14:41) OpenAI’s privacy consideration for data
(00:16:06) Sponsor: Netsuite | Omneky
(00:17:46) Encouraging the usage of instructions during fine-tuning
(00:19:19) Live player consideration of AI safety
(00:22:35) Da Vinci: new completions fine tuneable model
(00:24:59) Chat-GPT usage in decline
(00:30:03) Getting on demand tutoring on ML papers
(00:31:00) Code, education, and healthcare
(00:31:42) AI applications in coding
(00:38:12) AI revolution n education
(00:42:35) AI revolution in healthcare
(00:52:17) Call for feedback
LINKS:
Replit episode with Tyler Angert
Replit episode with VP of AI, Michele Catasta
AI Revolution in Education with Khan Academy's Director of Engineering, Shawn Jansepar
Google's Multimodal Med-PaLM with Vivek Natarajan and Tao Tu
X:
@labenz (Nathan)
@eriktorenberg (Erik)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
In this episode, Nathan sits down with Vivek Natarajan and Tao Tu of Google’s Med-PaLM, diving into how they used one of the world’s largest medical datasets ever compiled to develop Med-PaLM M, an AI agent specialized in medical tasks. In this episode, they discuss: Med-PaLM M's “clinically superhuman” abilities and limitations, the rigorous testing and validation that went into the model, and their vision for AI to take over repetitive clerical tasks and allow doctors to focus on patients.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode Preview
(00:00:56) Introducing Vivek Natarajan and Tao Tu
(00:04:18) The story of Google’s Medical AI research progress
(00:07:11) Multi-modal Med-PaLM
(00:10:32) Genomic data - how do you represent it?
(00:11:13) Google’s Deep Variant
(00:14:44) The successes and failures behind the incredible pace of progress
(00:15:02) Sponsors: Netsuite | Omneky
(00:21:54) Google’s research culture and assembling an interdisciplinary team
(00:31:36) Google’s Pathways
(00:33:40) Med-PaLM M's architecture
(00:37:28) Working with 3 different model sizes and what you learn
(00:46:56) Data and compute required for Med-PaLM M
(00:49:38) Med-PaLM M's cycle time
(00:54:56) Is a bridge or adapter structure worth implementing?
(01:00:09) Can we create an AI doctor?
(01:02:39) Emergent capabilities like identifying tuberculosis
(01:09:37) Reactions to these emergent capabilities
(01:11:13) Moving towards clinical trials and real-world testing
(01:13:01) Regulatory and safety considerations
(01:15:03) AI safety in the healthcare domain
(01:17:00) Potential to transform healthcare access worldwide
LINKS:
Med-PaLM: https://sites.research.google/med-palm/
Med-PaLM M paper: https://arxiv.org/abs/2307.14334
Our earlier conversation with Vivek Natarajan on Med-PaLM: https://www.youtube.com/watch?v=nPBd7i5tnEE
X/TWITTER:
@vivnat (Vivek)
@taotu831 (Tao)
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
In this episode, Nathan sits down with Shawn Jansepar, Director of Engineering at Khan Academy, to discuss their GPT-4 powered Socratic tutor, Khanmigo. In this conversation, Shawn and Nathan chat about Khan Academy’s collaboration with OpenAI and how they helped fine-tune GPT-4, how Khan Academy leveraged GPT-4 to build Khanmigo, and the impact of providing access to an AI tutor to any student. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode Preview: Education 10 years from now
(04:42) Khan Academy’s early access partnership with OpenAI
(06:31) Khanmigo: journey from Chrome extension to AI tutor
(11:36) GPT-4’s ability to be Socratic vs 3.5
(15:05) Sponsors: Netsuite | Omneky
(16:40) Integrating Khan Academy’s Pedagogy into AI
(20:06) The future of education 10 years from now
(22:37) Khan Academy’s models
(27:20) Demo-driven development process
(31:16) Sculpting the behaviour of a Socratic tutor model
(35:59) Khan Academy’s contribution to GPT-4’s fine-tuning and RLHF
(38:41) Being data-informed vs data-driven as a practice
(42:10) Incurring tech debt to get ahead of the curve
(45:28) The boundary for what an AI can/can’t t tutor
(49:30) Identifying when the user is confused and avoiding AI hallucinations
(53:54) Khanmigo’s development patterns
(59:11) Making Khanmigo jailbreak resistant
(01:01:50) Delivering personalized education with AI
(01:04:08) How Shawn and his team are thinking about AI education
(01:05:33) Khanmigo’s future multimodal interactivity
(01:08:42) Evaluating Khanmigo’s efficacy for student learning
(01:11:41) How widely is Khanmigo deployed today and what is the future for universal public access?
(01:05:11) Distribution through teachers and districts
(01:16:30) What are the reactions from teachers and education institutions to AI?
(01:18:15) Khanmigo’s pricing model
(01:19:03) The future roadmap for Khanmigo
(01:20:45) How will the AI tutor change the world at large?
LINKS:
Khanmigo: khan.co/khanmigo23
Benjamin Bloom’s 2-Sigma Problem: https://web.mit.edu/5.95/www/readings/bloom-two-sigma.pdf
X/TWITTER:
@shawnjan8 (Shawn)
@labenz (Nathan)
@eriktorenberg
@CogRev_Podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Join Nathan Labenz and Erik Torenberg as they analyze the last month in AI advancements. Nathan takes us through the meaningful updates to his Scouting Report (released last month, linked below), discusses highlights from recent episodes of The Cognitive Revolution, and gives us a sneak peek at upcoming interviews with Google researchers. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(01:00) How does the AI Scouting Report hold up a few weeks later?
(03:29) Zvi’s feedback on Nathan’s Tale of the Cognitive Tape
(10:25) The universal LLM jailbreak and adversarial examples
(12:53) Human performance is much more variable than AI performance
(14:45) Sponsors: NetSuite | Omneky
(16:09) Nathan’s AI Task Automation: What are good targets for tasks that can be automated for average businesses?
(20:05) Is GPT-4 getting worse or better?
(22:00) Getting explicit about what good looks like
(24:00) Prompting best practices are very accessible
(26:35) Ghostwriting - and the art of the hook
(28:05) Live Players: Which companies have say so over how the future goes?
(31:10) Upcoming guests from Google AI
(35:40) Possible post-transformer architectures
LINKS:
SCOUTING REPORT Part 1: https://youtu.be/0hvtiVQ_LqQ
SCOUTING REPORT Part 2: https://youtu.be/ovm4MbQ4G9E
SCOUTING REPORT Part 3: https://youtu.be/QJi0UJ_DV3E
3 Blue 1 Brown on YouTube: https://www.youtube.com/@3blue1brown
Tale of the Cognitive Tape in Part 1 of the Scouting Report: https://www.youtube.com/watch?v=0hvtiVQ_LqQ&t=3043s
Analyzing the Frontier with Zvi Mowshowitz: https://www.youtube.com/watch?v=SM4q-QAsoU8&t=1s
Tyler Cowen’s Interview with Jonathan Swift: https://conversationswithtyler.com/episodes/jonathan-gpt-swift/
X:
@labenz (Nathan)
@eriktorenberg (Erik)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you're looking for an ERP platform ✅ head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
In this episode, Nathan sits down with Replit’s VP of AI, Michele Catasta. Replit is building what CEO Amjad Masad calls "the perfect substrate for AGI." In this discussion, Michele and Nathan discuss Replit's state of AI development report, advantages when it comes to AI development, and the company's custom models. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode Preview
(00:00:57) Introduction
(04:44) What is artificial developer intelligence?
(15:05) Sponsors: Netsuite | Omneky
(16:58) Michele's background at Google & decision to join Replit
(19:16) Startups vs incumbents
(24:42) Whether Replit identifies as an e/acc company
(26:37) Staying apolitical on AI while being responsible
(30:36) The rise of LangChain and polarized reactions
(35:14) Estimates on developer productivity gains from AI assistance (2x-10x)
(38:35) Democratizing software development through easy customization
(44:02) AI generating disposable single-use software
(51:33) Optimism about humanity's ability to handle transformative AI
(55:01) The need for nuanced AI safety discussions
(56:14) Replit's data advantage from user code execution
(01:04:31) Replit's approach to training custom AI models
(01:08:51) The value of both open source and commercial models
(01:13:18) Michele’s highlights from being a Google researcher
(01:15:47) World knowledge needs in AI development
(01:20:37) Replit’s approach to AI safety
(01:24:28) The advantage of having a commercial model
(01:25:54) The costs of serving AI features to millions of users
(01:28:49) Modeling cost per user with AI workloads
(01:30:50) Pushing AI inference to the edge
(01:32:18) Ghostwriter integrating more deeply into Replit's IDE
(01:33:37) Replit as a potential "substrate for AGI"
LINKS:
https://replit.com/
Replit's State of AI Development Report: https://blog.replit.com/ai-on-replit
X:
@pirroh (Michele)
@labenz (Nathan)
@eriktorenberg (Erik)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
- NetSuite has 25 years of providing financial software for all your business needs. More than 36,000 businesses have already upgraded to NetSuite by Oracle, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you’re looking for an ERP platform, head to NetSuite: http://netsuite.com/cognitive and download your own customized KPI checklist.
- Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off. MUSIC CREDIT: MusicLM
In this episode, Nathan sits down with Tyler Angert, Product Designer at Replit, to discuss the future of software development. Replit is building what CEO Amjad Masad calls "the perfect substrate for AGI." In this discussion, Tyler and Nathan discuss how Replit is leveraging AI to enhance its current product, bot-bot interactions, the design and ethical considerations around AI agents, and more. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode Preview
(01:10) Nathan’s Intro
(09:24) Tyler's role at Replit
(11:37) Replit as the "perfect substrate for AGI"
(14:55) Sponsor: Netsuite | Omneky
(15:45) Defining AGI
(19:26) How AI agents might interact with Replit
(23:20) Replit's "virtual developer" product
(31:11) Current state of Replit's AI features like Ghostwriter
(41:48) Measuring productivity boost from AI coding tools
(45:29) Potential for massive developer productivity gains from AI
(48:34) Technical gaps still remaining to achieve advanced AI agents
(51:45) Core AI breakthroughs have already occurred
(57:35) Timeline for functional AI agents interacting online
(01:01:20) Safety considerations for powerful AI agents on Replit
(01:08:47) Ethical considerations around AI and consciousness
(01:20:36) What constitutes consciousness in an AI system
(01:26:30) Should AI systems have rights and protections?
(01:30:32) Being polite to AI systems
(01:31:59) Advice for beginners looking to leverage AI
(01:36:58) Using AI with a Neuralink brain implant
(01:43:02) Hopes and fears about AI's impact on society
LINKS:
https://replit.com/
X:
@tylerangert (Tyler)
@labenz (Nathan)
@eriktorenberg (Erik)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
-NetSuite provides financial software for all your business needs. More than thirty-six thousand companies have already upgraded to NetSuite, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you’re looking for an ERP platform: NetSuite (http://netsuite.com/cognitive) and defer payments of a FULL NetSuite implementation for six months.
-Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off. MUSIC CREDIT: MusicLM
This isn't news, it's analysis! Nathan Labenz sits down for an with Zvi Mowshowitz, the writer behind Don't Worry About the Vase to talk about the major players in AI over the last few months. In this extended conversation, Nathan and Zvi debate if AI has attained the intelligence of a well-read college graduate (per OpenAI's Jan Leike), a live player analysis (who to count/ who not to count), and the role of independent red teaming organizations. If you're looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
Definitely also take a moment to subscribe to Zvi's blog Don't Worry About the Vase (https://thezvi.wordpress.com/) - Zvi is an information hyperprocessor who synthesizes vast amounts of new and ever-evolving information into extremely clear summaries that help educated people keep up with the latest news. Highly recommend.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
TIMESTAMPS:
(00:00) Episode preview
(03:15) Is AI as intelligent as a college grad?
(07:45) Memories and context processing
(15:45) Sponsor: NetSuite | Omneky
(17:13) Is AI as intelligent as a college grad? cont'd
(20:47) Strengths and weaknesses of AI vs human
(31:05) OpenAI Superalignment
(37:23) The relationship between OpenAI and Anthropic
(44:31) Anthropic's security recommendations and adversarial attacks
(50:50) Is OpenAI using a constitutional AI approach?
(01:01:26) Context and stochastic parrots
(01:10) Is more context better?
(01:15:29) Should Nathan work at Anthropic?
(01:21:35) Google DeepMind's RT-2
(01:27:47) Multi-modal Med-PaLM
(01:31:50) Speculating about Gato
(01:35:10) Skepticism about Med-PaLM usage in radiology
(01:41:37) Llama 2 - what is going on at Meta??
(01:51:14) Llama 2 vs other models
(01:55:29) Who are the live players?
(02:01:38) China's AI developments
(02:02:41) Character AI and Inflection
(02:05:26) Replit as the perfect substrate for AGI
(02:10) AI girlfriends
(02:18:53) AI safety: The White House
(02:25:43) Bottlenecks to progress
(02:35:27) Can new players influence AI policy?
(02:39:00) Liabilities
(02:47:54) Independent red teaming organizations
(02:57:18) Mechanistic interpretability
X:
@labenz (Nathan)
@thezvi (Zvi)
@eriktorenberg (Erik)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
-NetSuite provides financial software for all your business needs. More than thirty-six thousand companies have already upgraded to NetSuite, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you’re looking for an ERP platform: NetSuite (http://netsuite.com/cognitive) and defer payments of a FULL NetSuite implementation for six months.
-Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that *actually work* customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
MUSIC CREDIT:
MusicLM
Arthur Conmy sits down with Nathan Labenz for an accessible deep dive into the state of interpretability research online today. They discuss how pioneering researchers have painstakingly worked to isolate the sub-circuits within transformers that are responsible for different aspects of AI capabilities. Arthur also introduces us to a new ACDC approach that he and his co-authors have taken to automating some of the most time-consuming parts of this work. If you’re looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
The Cognitive Revolution is a part of the Turpentine podcast network. Learn more: Turpentine.co
TIMESTAMPS:
(00:00) Episode Preview
(04:40) What attracted Arthur to mechanistic interpretability?
(07:49) LLM information processing: General Understanding vs Stochastic Parrot Paradigm
(14:00) ACDC paper: https://arxiv.org/abs/2304.14997
(14:45) Sponsors: NetSuite | Omneky
(24:30) Putting together data sets
(32:39) How to intervene in LLMs network activity
(36:00) Defining metrics to evaluate the production of correct completions
(44:20) The future of the mechanistic interpretability research
(50:00) Extracting upstream activations in the ACDC project and evaluating impact on downstream components.
(56:00) Anthropic research findings
(01:08:00) 3-Step process of the ACDC approach
(01:22:00) Setting a threshold and validation
(01:27:00) Goal of the approach
(01:32:00) Compute requirements
Correction: at (01:33:00), Arthur meant to say = "quadratic in nodes"
(01:35:30) Scaling laws for mechanistic interpretability
(01:40:00) Accessibility of this research for casual enthusiasts
(01:46:00) Emergence discourse
(01:56:00) Path to AI safety
LINKS:
https://arthurconmy.github.io/
https://arxiv.org/abs/2304.14997
X:
@labenz (Nathan)
@arthurconmy (Arthur)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
-NetSuite provides financial software for all your business needs. More than thirty-six thousand companies have already upgraded to NetSuite, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you’re looking for an ERP platform: NetSuite (http://netsuite.com/cognitive) and defer payments of a FULL NetSuite implementation for six months.
-Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that *actually work* customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Nathan Labenz sits down with Dr. Ronen Dar, CTO and co-founder of Run:ai, an Israel-based company that helps enterprises train and deploy AI models by optimizing GPU usage. The discussion covers how chip makers can meet the soaring demands, geopolitical fears, to the best practices companies can secure compute capacity. If you’re looking for an ERP platform, check out our sponsor, NetSuite: http://netsuite.com/cognitive
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
RECOMMENDATION: The AI Scouting Report
Playlist Parts 1-3: https://www.youtube.com/watch?v=0hvtiVQ_LqQ&list=PLVfJCYRuaJIXooK_KWju5djdVmEpH81ee
TIMESTAMPS:
(00:00) Episode Preview
(00:51) Introduction to Dr. Ronen Dar
(03:30) Run:ai's technology and what differentiates it from other solutions
(06:00) Today's market compared to when Dr. Ronen started five years ago
(13:40)Run:ai on market competitors like mosaicML
(14:55) Sponsors: NetSuite | Omneky
(22:00) The process and best practices by which companies secure compute capacity
(25:00) Dr. Ronen explains the GPU shortage
(31:50) GPU solutions
(36:00) Relative pricing across major providers
(41:00) What other chip makers are going to be relevant?
(49:00) Global outlook for chip production
(52:45) Worldview around the US-China AI race
(58:00) Can controls on hardware actually control access to AI?
LINKS:
https://www.run.ai/
SOCIAL MEDIA:
@labenz (Nathan)
@ronen_dar
@runailabs (Run:ai)
@cogrev_podcast
SPONSORS: NetSuite | Omneky
-NetSuite provides financial software for all your business needs. More than thirty-six thousand companies have already upgraded to NetSuite, gaining visibility and control over their financials, inventory, HR, eCommerce, and more. If you’re looking for an ERP platform -> NetSuite: http://netsuite.com/cognitive and defer payments of a FULL NetSuite implementation for six months.
-Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that *actually work* customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Head over to YouTube to watch Part 2 of The AI Scouting Report (https://www.youtube.com/watch?v=ovm4MbQ4G9E), supported by slides and visual aides. In Part 2, Nathan Labenz builds on Part 1: AI Fundamental and delves into recent trends and practical applications for AI. Nathan's aim is to impart the equivalent of a high school AP course understanding to listeners in 90-minute installments.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
Listener comments on Part 1:
"Incredible summary! I’ve watched probably 20 videos like this and I’ve always been left with unanswered questions."
"This is a really excellent overview! Enjoyed going through this and looking forward to episode 2. Thank you for sharing it on here for free!"
"Best 90 minutes this old PhD physicist has spent in a long time!"
LINKS:
Part 1: https://youtu.be/0hvtiVQ_LqQ
Part 2: https://www.youtube.com/watch?v=ovm4MbQ4G9E
Part 3 released later this week!
RECOMMENDED PODCAST:
Run the Numbers is a weekly podcast about financial metrics and business models, designed for ambitious people operating tech startups. It's a collection of things host CJ Gustafson (CFO at Partstech and writer of Mostly Metrics) has learned and thought about in the trenches as a tech CFO. Subscribe to listen on the platform of your choice: https://link.chtbl.com/runthenumbers
Questions or topics you want us to review on the podcast? Email TCR@turpentine.co
The Cognitive Revolution is a part of the Turpentine podcast network. Learn more: Turpentine.co
TWITTER:
@CogRev_Podcast
@labenz (Nathan)
@eriktorenberg (Erik)
Nathan Labenz interviews Div Garg, founder of MULTI·ON, the world's first personal AI agent and life copilot. Div talks about the product strategy and roadmap for the MULTI·ON browser, their natural language approach to skills, and the steps they are taking to ensure user safety. Div explains how the platform uses a critic model to detect the success or failure of tasks, and how it can be used to book flights, order food, and more. Div also talks about the future of memory systems, such as the user profile feature, and how it can be used to improve the user experience.
RECOMMENDED PODCAST:
Founding a business is just the tip of the iceberg; the real complexity comes with scaling it. On 1 to 1000, hosts Jack Altman and Erik Torenberg dig deep into the inevitable twists and turns operators encounter along the journey of turning an idea into a business. Hear all about the tactical challenges of scaling from the people that built up the world’s leading companies like Stripe, Ramp, and Lattice. Our first episode with Eric Glyman of Ramp is out now: https://link.chtbl.com/1to1000
The Cognitive Revolution is part of the Turpentine podcast network. Learn more at turpentine.co
Have your AI questions answered on an episode by emailing TCR@turpentine.co
Send your friends our 90 min AI Scouting Report with visual aides! The AI Scouting Report Part 1: The Fundamentals is on YouTube at https://www.youtube.com/watch?v=0hvtiVQ_LqQ . Part II is coming next week.
TIMESTAMPS:
(00:00) Episode preview
(06:42) AI agents applied to everyday life.
(12:03) AI-driven automation with browser extension.
(15:02) Sponsor: Omneky
(18:17) AI-driven automation of web tasks.
(23:57) Task automation and planning.
(29:43) Automate task completion with user validation.
(34:45) Lifelong learning agent with high-level skills
(40:19) Guiding users to create skills safely.
(46:53) AI assistant to simplify lives.
(53:14) Unlock parallelism with AI agents.
LINKS:
MULTI·ON: https://www.multion.ai/
Div Garg; https://divyanshgarg.com/
TWITTER:
@labenz (Nathan)
@DivGarg9
@eriktorenberg (Erik)
@cogrev_podcast
SPONSOR: Thank you Omneky (www.omneky.com) for sponsoring The Cognitive Revolution. Omneky is an omnichannel creative generation platform that lets you launch hundreds of thousands of ad iterations that actually work, customized across all platforms, with a click of a button. Omneky combines generative AI and real-time advertising data. Mention "Cog Rev" for 10% off.
Music: GoogleLM