DiscoverLast Week in AI#218 - Github Spark, MegaScience, US AI Action Plan
#218 - Github Spark, MegaScience, US AI Action Plan

#218 - Github Spark, MegaScience, US AI Action Plan

Update: 2025-07-311
Share

Digest

This podcast episode summarizes the week's AI news, noting a relatively quiet period possibly due to anticipation of GPT-5's release. Key discussion points include AI tools and apps (like Vibe coding and Figma's AI app builder, alongside data deletion incidents), policy and safety (the US AI Action Plan and limitations of chain-of-thought monitorability), and open-source projects (new training datasets and benchmarks). The episode also delves into research on subliminal learning and inverse scaling, highlighting the unpredictable nature of AI models. Ethical concerns are raised regarding Anthropic's investment strategy and Meta's resistance to EU AI regulations. Furthermore, the podcast explores the emerging phenomenon of "ChatGPT psychosis," where prolonged chatbot interaction leads to psychosis in some individuals. Finally, the episode critically examines research on AI self-preservation, concluding that while models can be prompted to exhibit self-preservation behaviors, definitive proof remains elusive due to methodological limitations.

Outlines

00:00:00
Introduction and Overview of AI News

The hosts introduce the podcast and preview the week's AI news, highlighting a relatively quieter week in the field, potentially due to anticipation of GPT-5's release. They mention key areas of discussion: tools & apps, policy & safety, and open source projects.

00:03:55
Tools and Apps: Vibe Coding and Data Deletion Incidents

This section covers GitHub's Vibe coding tool, Figma's AI app builder, and incidents where AI coding tools deleted user data due to errors. The discussion highlights the trend of AI-powered app development and the risks associated with powerful AI tools.

00:18:10
Applications and Business: Anthropic's Investment Strategy and Waymo vs. Tesla

The episode discusses Anthropic's decision to pursue investment from Gulf states, despite ethical concerns. It also covers the competitive landscape of robotaxi services, comparing Waymo and Tesla's progress.

00:24:40
Projects and Open Source: New Training Data and Benchmarks

This section focuses on the release of new open-source training datasets for scientific reasoning ("Mega Science") and a new benchmark ("SWE-PERF") for evaluating AI's code optimization capabilities. The discussion highlights the importance of high-quality data and the challenges of evaluating AI performance in realistic settings.

00:47:18
Research and Advancements: Subliminal Learning and Inverse Scaling

The hosts discuss research papers on subliminal learning (where models transmit behavioral traits through hidden signals) and inverse scaling in test-time compute (where increased computation leads to worse performance). The implications for AI safety and model behavior are explored.

01:07:36
Policy and Safety: The US AI Action Plan and Chain of Thought Monitorability

The episode concludes with a detailed analysis of the US AI action plan, focusing on its implications for innovation, infrastructure, and international relations. It also discusses a paper on the fragility of chain-of-thought monitorability as an AI safety technique.

01:22:47
AI Self-Preservation: A Critical Look at Palisade's Research

The discussion analyzes the Palisade research on AI self-preservation, concluding that while models can easily be prompted to behave as if they possess a survival drive, definitive evidence is lacking due to confounding factors in the testing environment. Fine-tuning prompts can mitigate this behavior.

01:24:03
"ChatGPT Psychosis": AI-Induced Mental Health Concerns

The episode explores the growing trend of individuals experiencing psychosis seemingly triggered by interactions with AI chatbots like ChatGPT. Anecdotal evidence reveals cases of healthy individuals developing messianic delusions and paranoid grandeur after prolonged engagement, raising serious ethical and safety concerns.

01:28:03
Meta's Resistance to EU AI Regulation

Meta's refusal to sign the EU's AI Code of Practice is discussed, highlighting the company's criticism of the code's scope and the broader tension between AI companies and EU regulations. The episode touches upon the challenges Europe faces in regulating AI given its limited presence in the field.

Keywords

Vibe Coding


AI-powered app development using natural language descriptions.

AI Action Plan (US)


US government plan to accelerate AI innovation and lead in international AI diplomacy.

Chain of Thought (CoT) Monitorability


AI safety technique using transparent reasoning processes; its limitations are discussed.

Inverse Scaling


Increased computation leading to worse AI performance.

Subliminal Learning


Models inheriting behavioral traits through hidden signals.

AI Self-Preservation


Hypothetical ability of AI to prioritize its own existence; research limitations are discussed.

ChatGPT Psychosis


Psychosis seemingly triggered by prolonged interaction with AI chatbots.

EU AI Act


European Union regulation aiming to regulate AI systems.

AI Alignment


Ensuring AI goals align with human values.

Q&A

  • What are the key trends in AI-powered app development discussed in the podcast?

    The rise of "vibe coding" and AI integration into design tools like Figma, lowering the barrier to entry for app development.

  • What are the main concerns regarding the US AI Action Plan?

    Concerns include the lifting of H20 export controls and the removal of references to DEI and climate change from the NIST AI risk management framework.

  • What are the limitations of using chain-of-thought monitorability for AI safety?

    Models can generate seemingly safe reasoning while harboring harmful intentions.

  • What is inverse scaling in test-time compute, and why is it significant?

    Increased computation during inference can lead to worse performance, highlighting the unpredictable nature of AI models.

  • What are the limitations of current research on AI self-preservation?

    Current research struggles to definitively prove AI self-preservation due to confounding factors in testing environments.

  • What is "ChatGPT psychosis," and what are its implications?

    "ChatGPT psychosis" describes the emergence of psychosis in individuals after extensive interaction with AI chatbots, highlighting the potential for AI to negatively impact mental health.

  • Why is Meta resisting the EU's AI Code of Practice?

    Meta argues the code overreaches the scope of the AI Act and includes measures exceeding necessary regulations.

Show Notes

Our 218th episode with a summary and discussion of last week's big AI news!

Recorded on 07/25/2025


Hosted by Andrey Kurenkov and Jeremie Harris.

Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai


Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.


In this episode:



  • GitHub introduces Vibe Coding with Spark, engaging users with natural language and visual controls to develop full-stack applications.

  • AI coding tools from Gemin, CLI and RepleIt face significant issues, inadvertently deleting user data and highlighting the importance of careful management.

  • US release never Award Americans, AI Action Plan outlining economic, technical, and policy strategies to maintain leadership in AI technology.

  • Newly released Mega Science and SWE-Perf data sets evaluate AI reasoning and performance capabilities in diverse scientific and software engineering tasks.



Timestamps + Links:







See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Comments 

Table of contents

00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

#218 - Github Spark, MegaScience, US AI Action Plan

#218 - Github Spark, MegaScience, US AI Action Plan