Super Data Science: ML & AI Podcast with Jon Krohn

776 Episodes

Reverse

775: What will humans do when machines are vastly more intelligent? With Aleksa Gordić

2024-04-1601:36:41

Tech entrepreneurship, artificial superintelligence, and the future of education: Aleksa Gordić speaks to Jon Krohn about his strategies for self-directed learning, the traits that help people succeed in moving from big tech to entrepreneurship, and the social impact of artificial superintelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to motivate yourself to become a tech entrepreneur [17:02] • Aleksa’s checklist for the perfect CTO [35:00] • Potential sustainable solutions for LLMs [41:51] • The next major developments in AI and tech [48:29] • How hobbies have a knock-on effect for a person’s career [1:01:53] • How and why formal education needs to change [1:09:24] Additional materials: www.superdatascience.com/775

774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities

2024-04-1212:52

Covariant's RFM-1: Jon Krohn explores the future of AI-driven robotics with RFM-1, a groundbreaking robot arm designed by Covariant and discussed by A.I. roboticist Pieter Abbeel. Explore how this innovation aims to merge digital intelligence with the physical world, promising a new era of efficiency and autonomy. Additional materials: www.superdatascience.com/774 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas

2024-04-0901:07:40

Dr. Barrett Thomas, an award-winning Research Professor at the University of Iowa, explores the intricacies of Markov decision processes and their connection to Deep Reinforcement Learning. Discover how these concepts are applied in operations research to enhance business efficiency and drive innovations in same-day delivery and autonomous transportation systems. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Barrett's start in operations logistics [02:27] • Concorde Solver and the traveling salesperson problem [09:59] • Cross-function approximation explained [19:13] • How Markov decision processes relate to deep reinforcement learning [26:08] • Understanding policy in decision-making contexts [33:40] • Revolutionizing supply chains and transportation with aerial drones [46:47] • Barrett’s career evolution: past changes and future prospects [52:19] Additional materials: www.superdatascience.com/773

772: In Case You Missed It in March 2024

2024-04-0524:00

Pytorch benefits, how to get funding for your AI startup, and managing scientific silos: In our new series for SuperDataScience, “In Case You Missed It”, host Jon Krohn engages in some “reinforcement learning through human feedback” of his own with need-to-hear sound bites from past SDS episodes! Additional materials: www.superdatascience.com/772 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

771: Gradient Boosting: XGBoost, LightGBM and CatBoost, with Kirill Eremenko

2024-04-0201:55:27

Kirill Eremenko joins Jon Krohn for another exclusive, in-depth teaser for a new course just released on the SuperDataScience platform, “Machine Learning Level 2”. Kirill walks listeners through why decision trees and random forests are fruitful for businesses, and he offers hands-on walkthroughs for the three leading gradient-boosting algorithms today: XGBoost, LightGBM, and CatBoost. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about decision trees [09:28] • All about ensemble models [22:03] • All about AdaBoost [38:46] • All about gradient boosting [46:51] • Gradient boosting for classification problems [1:01:26] • All about XGBoost, LightGBM and CatBoost [1:04:12] Additional materials: www.superdatascience.com/771

770: The Neuroscientific Guide to Confidence

2024-03-2945:22

Explore the science of confidence with Lucy Antrobus, as she unveils neuroscience-backed strategies to build and boost confidence through practice, positive energy, and the power of laughter. An essential listen for fostering unshakable self-assurance. Additional materials: www.superdatascience.com/770 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

769: Generative AI for Medicine, with Prof. Zack Lipton

2024-03-2601:49:12

Generative AI in medicine takes center stage as Prof. Zachary Lipton, Chief Scientific Officer at Abridge, joins host Jon Krohn to discuss the significant advancements in AI that are reshaping healthcare. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • The inspiration for Zack to get started in ML and healthcare [03:56] • The hardware required to use Abridge [12:29] • The key data science projects at Abridge right now [35:05] • Abridge's tech stack [59:54] • How Abridge ensures reliability in a high-stakes setting like healthcare [1:07:29] • How Zack’s academic research cross-pollinates with his commercial ML projects [1:21:05] • How Zack’s jazz background molded his entrepreneur and data science journey [1:30:32] Additional materials: www.superdatascience.com/769

768: Is Claude 3 Better than GPT-4?

2024-03-2212:55

Claude 3, LLMs and testing ML performance: Jon Krohn tests out Anthropic’s new model family, Claude 3, which includes the Haiku, Sonnet and Opus models (written in order of their performance power, from least to greatest). Can it stand shoulder to shoulder with other models such as GPT-4 and Gemini 1.0 Ultra? And how important is it for machine learning practitioners to try out these models with their own benchmarks? Jon walks listeners through a test of his own in this Five-Minute Friday. Additional materials: www.superdatascience.com/768 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

767: Open-Source LLM Libraries and Techniques, with Dr. Sebastian Raschka

2024-03-1901:48:12

Jon Krohn sits down with Sebastian Raschka to discuss his latest book, Machine Learning Q and AI, the open-source libraries developed by Lightning AI, how to exploit the greatest opportunities for LLM development, and what’s on the horizon for LLMs. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about Machine Learning Q and AI [04:13] • Sebastian Raschka’s role as Staff Research Engineer at Lightning AI [19:21] • PyTorch Lightning’s and Lightning Fabric’s capabilities [39:32] • Large language models: Opportunities and challenges [43:35] • DoRA vs LoRA [48:56] • How to be a successful AI educator [1:34:18] Additional materials: www.superdatascience.com/767

766: Vonnegut's Player Piano (1952): An Eerie Novel on the Current AI Revolution

2024-03-1508:13

Kurt Vonnegut's "Player Piano" delivers striking parallels between its dystopian vision and today's AI challenges. This week, Jon Krohn explores the novel's depiction of a world where humans are marginalized by machines, reflecting on the impact of automation on society and the ethical considerations it raises. Tune in as we unpack the timeless relevance of Vonnegut's work to the AI era. Additional materials: www.superdatascience.com/766 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

765: NumPy, SciPy and the Economics of Open-Source, with Dr. Travis Oliphant

2024-03-1201:37:291

Explore the origins of NumPy and SciPy with their creator, Dr. Travis Oliphant. Discover the journey from personal need to global impact, the challenges overcome, and the future of these essential Python libraries in scientific computing and data science. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), by Data Universe, the out-of-this-world data conference (https://datauniverse2024.com), and by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Travis's journey to creating NumPy and SciPy [08:05] • How Anaconda got started [42:24] • How Numba, a high-performance Python compiler, was brought to market [54:48] • Python's influence on the thought processes of scientists and engineers [1:04:21] • The commercial projects that support Travis’s vast open-source efforts and communities [1:10:22] • How to get involved in Travis's commercial projects and communities [1:22:34] • The future of scientific computing and Python libraries [1:29:50] Additional materials: www.superdatascience.com/765

764: The Top 10 Episodes of 2023

2024-03-0808:041

Data science futurists, bestselling authors, and lively how-to guides from the industry’s top practitioners, which range from applying data science for good to using open-source tools for NLP: This is The Super Data Science Podcast’s top ten most listened-to episodes in 2023, hosted by Jon Krohn. A great snapshot of our great content from 2023. Additional materials: www.superdatascience.com/764 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

763: The Best A.I. Startup Opportunities, with venture capitalist Rudina Seseri

2024-03-0501:27:14

At Glasswing Ventures, Rudina Seseri wants to be able to answer the question: What has Glasswing Ventures done for the company beyond capital investment? She speaks to Jon Krohn about how her company uses data to assess venture capital investments, the secret sauce of successful AI startups, and why she feels generative AI is only the start of a much broader impact that AI will make in communities and businesses. This episode is brought to you by the DataConnect Conference (https://www.dataconnectconf.com/dccwest/conference), and by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Potential interest areas for Series A AI venture capitalists [12:22] • How Glasswing’s AI Palette helps AI startups [23:06] • How data driven the venture capital industry is [27:21] • Advice for adopting services from AI providers [47:21] • Model collapse: Causes and concerns [58:44] • Glasswing’s checklist for AI startups [1:04:59] Additional materials: www.superdatascience.com/763

762: Gemini 1.5 Pro, the Million-Token-Context LLM

2024-03-0116:58

Jon Krohn presents an insightful overview of Google's groundbreaking Gemini Pro 1.5, a million-token LLM that's transforming the landscape of AI. Discover the innovative aspects of Gemini Pro 1.5, from its extensive context window to its multimodal functionalities, which are broadening the scope of AI technology and signifying a significant leap in data science. Plus, join Jon for a practical demonstration, showcasing the real-world applications, capabilities, and limitation of this advanced language model. Additional materials: www.superdatascience.com/762 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

761: Gemini Ultra: How to Release an A.I. Product for Billions of Users, with Google's Lisa Cohen

2024-02-2701:10:15

Google's Gemini Ultra takes the spotlight this week, as host Jon Krohn welcomes Lisa Cohen, Google's Director of Data Science and Engineering, for a conversation about the launch of Gemini Ultra. Discover the capabilities of this cutting-edge large language model and how it stands toe-to-toe with GPT-4. Lisa shares her insights on the development, rollout, and potential of Gemini Ultra in reshaping various sectors. Whether you're a data science professional, tech enthusiast, or curious about the future of AI, this episode offers a deep dive into one of the most significant advancements in artificial intelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), and by Intel and HPE Ezmeral Software Solutions (https://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Google’s Gemini model family and Lisa's key responsibilities [04:55] • How LLMs will transform the practice of Data Science [19:47] • Lisa on prompt engineering and reinforcement learning from human feedback [24:38] • How to fine-tune Gemini models with Google's Vertex AI [30:52] • How AI-assistants will transform life and work for everyone from data scientists to educators to children [47:14] • The challenges of developing a data-centric culture [57:31] • Centralized vs decentralized data science teams [1:03:50] Additional materials: www.superdatascience.com/761

760: Humans Love A.I.-Crafted Beer

2024-02-2306:31

AI-crafted beer, machine learning for passion projects, and self-taught data science: Jon Krohn and Beau Warren’s hotly anticipated, data-driven, punny lager Krohn&Borg is finally given a taste test in this week’s Five-Minute Friday. Heading to the Species X brewery in Columbus, Ohio, Jon Krohn and Beau Warren launched the beer that had been predicted, optimized and developed by a machine-learning model. Additional materials: www.superdatascience.com/760 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko

2024-02-2001:43:13

Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode! This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), by Oracle NetSuite business software (netsuite.com/superdata), and by Intel and HPE Ezmeral Software Solutions (http://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How decoder-only transformers work [15:51] • How cross-attention works in transformers [41:05] • How encoders and decoders work together (an example) [52:46] • How encoder-only architectures excel at understanding natural language [1:20:34] • The importance of masking during self-attention [1:27:08] Additional materials: www.superdatascience.com/759

758: The Mamba Architecture: Superior to Transformers in LLMs

2024-02-1608:12

Explore the groundbreaking Mamba model, a potential game-changer in AI that promises to outpace the traditional Transformer architecture with its efficient, linear-time sequence modeling. Additional materials: www.superdatascience.com/758 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

757: How to Speak so You Blow Listeners' Minds, with Cole Nussbaumer Knaflic

2024-02-1301:29:031

Explore mind-blowing storytelling with Cole Nussbaumer Knaflic in this episode. Audience favorite and author of "Storytelling with You," Cole returns to share essential tips for crafting impactful presentations, emphasizing narrative construction and audience engagement. Learn how to effectively communicate data and stories, enhancing your presentations with insights from a leading expert in the field. This episode is brought to you by CloudWolf (https://www.cloudwolf.com/sds), the Cloud Skills platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to become a confident communicator [11:59] • How to get rid of filler words [26:32] • How facts alone can't make a strong impact [41:44] • Cole's overview of her book Storytelling with You [55:19] • How to craft an effective presentation [1:00:24] • Common mistakes in virtual presentations [1:09:48] • Cole's virtual presentation setup [1:15:33] • Cole's next book Daphne Draws Data [1:20:23] Additional materials: www.superdatascience.com/757

756: AlphaGeometry: AI is Suddenly as Capable as the Brightest Math Minds

2024-02-0908:45

AlphaGeometry, intuitive AI, and geometric deduction: In this week’s Five-Minute Friday, Super Data Science host Jon Krohn looks into developments from DeepMind, Google’s ground-breaking AI lab, and explores how this is a critical step towards a future of broadly accessible AI solutions across scientific disciplines. Additional materials: www.superdatascience.com/756 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.

Comments (29)

atefeh

thank you for this episode.

Mar 5th

Reply (1)

mrs rime

🔴💚Really Amazing ️You Can Try This💚WATCH💚ᗪOᗯᑎᒪOᗩᗪ👉https://co.fastmovies.org

Jan 16th

Priya Dharshini

🔴WATCH>>ᗪOᗯᑎᒪOᗩᗪ>>👉https://co.fastmovies.org

Andrew Miller

I found this podcast really helpful for anyone who wants to better their knowledge of machine learning. I am especially interested in the data processing. If you want to deepen your knowledge of this topic, check this article https://techlogitic.net/categorization-and-data-labeling-for-supervised-machine-learning/. It has some pretty useful information and professional tips from experts in data annotation and tagging.

Apr 21st

Toben Nelson

a really nice and quick overview with just the right amount of detail.

Mar 3rd

Maryam Alizadeh

great thanks to you and your endeavors for this pod. I learnt a lot. welcome to Jon , wish you the best 👏👍

Jan 4th

😢

Masoud Fard

you are the best

Nov 19th

Nikhil Parmar

nice summarisation, Data Analyts looks at the past and data scientist looks at past and future

Oct 27th

Tough Nut

Great talk, very inspiring. thanks.

Aug 6th

Venkat M

Sleeps 3 hrs a day, not a good example for healthy person. sleep well and keep the brain more refreshed and healthy. #health

Mar 13th

Mehrdad Salimi

a lot of extra, unrelated stuff. Dude I appreciate your effort but you need to be specific and respect audiences' time.

Jan 19th

Maria Lacerda

Eu não conhecia Gabriela de Queiroz mas agora ouvindo esse podcast (já ouvi umas 5x) estou completamente encantada. Muito legal descobrir esse nível de profissional pelo mundo e ainda saber que trata-se de uma brasileira.

Dec 2nd

Natalia Zawadzka

Great job!👍 It's so interesting to listen your podcasts! thanks for sharing your knowledge and helping people to get into data business 🙌👍

Sep 10th

SriLatha K

Hi thanks for doing this podcast. Being a data engineer and who commutes a lot, I gain a lot from your podcasts. One suggestion that I would like to give is, it would be better if you do not interrupt the speaker until they complete their flow.

May 10th

Alberto Andrade

What amazing episode! Adrian rocks!! Congratulations!

Apr 25th

Simon SOUVANNARAT

Thanks for this advice !

Feb 6th

Troy Kirin

Great episode! I wish he touched on how to connect Sparklyr to data viz like Tableau!

Dec 10th

Richard Leyshon

thought this was one of my stoic podcast episodes! Great message.

Dec 1st

Ari Meier

Great episode! I'd love to access the show notes, but is having an issue pulling up the link.

Nov 10th

#box-pro-ellipsis-171350390685364{-webkit-line-clamp:2;}Super Data Science: ML & AI Podcast with Jon Krohn

atefeh

mrs rime

Priya Dharshini

Andrew Miller

Toben Nelson

Maryam Alizadeh

Maryam Alizadeh

Masoud Fard

Nikhil Parmar

Tough Nut

Venkat M

Mehrdad Salimi

Maria Lacerda

Natalia Zawadzka

SriLatha K

Alberto Andrade

Simon SOUVANNARAT

Troy Kirin

Richard Leyshon

Ari Meier

Super Data Science: ML & AI Podcast with Jon Krohn