DiscoverSuper Data Science: ML & AI Podcast with Jon Krohn
Super Data Science: ML & AI Podcast with Jon Krohn
Claim Ownership

Super Data Science: ML & AI Podcast with Jon Krohn

Author: Jon Krohn

Subscribed: 12,244Played: 429,189
Share

Description

The latest machine learning, A.I., and data career topics from across both academia and industry are brought to you by host Dr. Jon Krohn on the Super Data Science Podcast.

As the quantity of data on our planet doubles every couple of years and with this trend set to continue for decades to come, there's an unprecedented opportunity for you to make a meaningful impact in your lifetime. In conversation with the biggest names in the data science industry, Jon cuts through hype to fuel that professional impact.

Whether you're curious about getting started in a data career or you're a deep technical expert, whether you'd like to understand what A.I. is or you'd like to integrate more data-driven processes into your business, we have inspiring guests and lighthearted conversation for you to enjoy.

We cover tools, techniques, and implementation tricks across data collection, databases, analytics, predictive modeling, visualization, software engineering, real-world applications, commercialization, and entrepreneurship − everything you need to crush it with data science.
793 Episodes
Reverse
Jon Krohn shares his favorite clips from May. Hear how Navdeep Martin is spearheading a company to tackle the climate crisis, why Sol Rashidi and Demetrios Brinkmann find nailing job titles so necessary in the fast-paced industries of tech and AI, and get the latest on embeddings with Luis Serrano. Additional materials: www.superdatascience.com/792 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Reinforcement learning through human feedback (RLHF) has come a long way. In this episode, research scientist Nathan Lambert talks to Jon Krohn about the technique’s origins of the technique. He also walks through other ways to fine-tune LLMs, and how he believes generative AI might democratize education. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0), and Crawlbase (https://crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Why it is important that AI is open [03:13] • The efficacy and scalability of direct preference optimization [07:32] • Robotics and LLMs [14:32] • The challenges to aligning reward models with human preferences [23:00] • How to make sure AI’s decision making on preferences reflect desirable behavior [28:52] • Why Nathan believes AI is closer to alchemy than science [37:38] Additional materials: www.superdatascience.com/791
The experts reveal their top open-source R libraries with us live from the New York R Conference! This Super Data Science Podcast episode features an exclusive panel with data science trailblazers Drew Conway, Jared Lander, Emily Zabor, and JD Long. They share their favorite R libraries and valuable insights to enhance your data science practice. Additional materials: www.superdatascience.com/790 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Machine Learning for Wind Energy is front and center in this episode as Jon Krohn is joined by Dr. Jason Yosinski, CEO of Windscape AI. Dr. Yosinski brings to light the latest ML advancements sparking significant changes in renewable energy. Tune in for a comprehensive review of these cutting-edge technologies and their expansive impact on the industry and the environment's well-being. This episode is brought to you by Crawlbase (https://crawlbase.com), the ultimate data crawling platform. Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Enhancing predictability in wind energy with ML [04:52] • Data utilization from wind turbines by energy providers [11:41] • Jason's journey into wind energy [17:55] • Landing the right startup idea [22:47] • Visualizing neural networks with the Deep Vis Toolbox [31:29] • Extreme event forecasting at Uber vs. nowcasting at Windscape AI [45:13] • Discoveries from Loss Change Allocation research [47:48] • Engaging with Jason's ML Collective [59:46] • Traits of successful AI entrepreneurs [1:10:26] Additional materials: www.superdatascience.com/789
Multi-agent systems could mark a significant turning point in generative AI. From mastering increasingly complex tasks to getting LLMs to collaborate, in this Five-Minute Friday, Jon Krohn discusses the systems that are working to bridge the remaining gaps left by the latest large language models (LLMs). Additional materials: www.superdatascience.com/788 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
MLOps, how to build an online community, and tools for scaling LLMs: In this episode, Demetrios Brinkmann speaks to Jon Krohn about the similarities and differences between LLMOps, MLOps and DevOps, and why this should matter to companies looking to hire such engineers. You will also hear how to get involved in the MLOps community wherever you are in the world, and how you can start developing great products with the available tools. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • What MLOps is [03:51] • About LLMOps [12:06] • About LlamaIndex and Ollama [18:29] • Insights from Demetrios’ MLOps survey [20:49] • Guidance for using third-party APIs [40:18] • Recommendations for building an online community in tech and AI [47:07] Additional materials: www.superdatascience.com/787
Learn about the six keys to data science success as host Jon Krohn welcomes back Kirill Eremenko, the mastermind behind SuperDataScience. Kirill shares his top insights on data science careers, from building strong portfolios to leveraging mentors and hands-on labs. With over 2.7 million students, his advice is a must-hear for aspiring and experienced data scientists alike. Additional materials: www.superdatascience.com/786 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Dr. Luis Serrano from the Serrano Academy reveals how to make Math and Quantum ML accessible, tackles the challenges of teaching A.I. to beginners, and explores the power of embeddings in enterprise applications. Explore the future of Quantum Machine Learning and the latest trends in AI, including multimodality and autonomous systems. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How math and AI can be made easy to understand [05:21] • The three major categories of learners [16:21] • Why embeddings are the most important component of LLMs [26:19] • How semantic search differs from a traditional keyword search [29:57] • The most exciting emerging application areas for AI [42:41] • The promising application areas for Quantum Machine Learning [49:18] Additional materials: www.superdatascience.com/785
Aligning LLMs: How can we teach pre-trained LLMs to hold a conversation and learn new information from each other? This was where Sinan Ozdemir began his investigation into aligning LLMs. In this episode, he talks to Jon Krohn about the limitations of definitions for LLMs, training LLMs, and whether it is possible to train an LLM without alignment. Additional materials: www.superdatascience.com/784 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Recent advances in GenAI, how to tackle the climate crisis with advanced technology, and addressing the knowledge gap in understanding AI: Jon Krohn speaks to Flypower co-founder and CEO Navdeep Martin about the advances made in GenAI, from products to applications, and how we might use AI to tackle climate change. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How the Washington Post’s recommendation systems work [03:29] • Why product leaders make great CEOs [10:36] • How Flypower uses GenAI to tackle climate change [22:13] • How Flypower identifies its customers’ most pertinent questions [30:03] • How AI might come to tackle climate change [36:52] • How to mitigate hallucination in AI models [41:04] Additional materials: www.superdatascience.com/783
Hear Jon Krohn’s favorite five clips from his April interviews. Chief Scientist at Posit PBC Hadley Wickham on the subtle differences between Python and R. Professor of Business Analytics Barrett Thomas walks through the variables that companies should consider when using drones or any other tech to improve their business operations and bottom line. Aleksa Gordić, Founder of Runa AI believes an overhaul of the current educational system is long overdue. Bernard Marr discusses the future of GenAI and its impact on the world of work. And SuperDataScience founder Kirill Eremenko gives a lively workshop on gradient boosting. Additional materials: www.superdatascience.com/782 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Sol Rashidi, a distinguished data executive who has served in C-suite roles at Fortune 100 companies, joins Jon Krohn to delve into successful enterprise AI strategies and the reasons behind the high turnover among Chief Data Officers. This episode provides an in-depth look at selecting AI projects that succeed and understanding the strategic value of patents in various industries. Benefit from Sol’s extensive experience and practical advice on navigating complex corporate challenges. This episode is brought to you by AWS Inferentia (https://go.aws/3zWS0au) and AWS Trainium (https://go.aws/3ycV6K0). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How CDOs and related roles have such high turnover because [09:40] • The importance of building relationships in AI projects [17:01] • How Sol's book "The AI Survival Guide" came about [20:44] • How high-criticality, low-complexity AI projects are the ones with the highest probability of success [27:11] • How Enterprise data security issues can be resolved with technologies like Protopia’s stained-glass data-masking solution [36:10] • Why having great data engineers is essential [47:57] • The value of patents [51:45] Additional materials: www.superdatascience.com/781
Want to become a data scientist? Jon and Adam discuss the key steps to becoming a data scientist, with a focus on developing portfolio projects. Hear about the 10 project ideas Adam recommends in his book to help you stand out in the data science community. Additional materials: www.superdatascience.com/780 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Tidyverse, ggplot2, and the secret to a tech company’s longevity: Hadley Wickham talks to Jon Krohn about Posit’s rebrand, Tidyverse and why it needs to be in every data scientist’s toolkit, and why getting your hands dirty with open-source projects can be so lucrative for your career. This episode is brought to you by Intel and HPE Ezmeral Software (https://bit.ly/hpeintel). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • All about the Tidyverse [04:46] • Hadley’s favorite R libraries [17:10] • The goal of Posit [30:29] • On bringing multiple programming languages together [36:02] • The principles for a long-lasting tech company [52:10] • How Hadley developed ggplot2 [55:24] • How to contribute to the open-source community [1:05:43] Additional materials: www.superdatascience.com/779
Mixtral 8x22B is the focus on this week's Five-Minute Friday. Jon Krohn examines how this model from French AI startup Mistral leverages its mixture-of-experts architecture to redefine efficiency and specialization in AI-powered tasks. Tune in to learn about its performance benchmarks and the transformative potential of its open-source license. Additional materials: www.superdatascience.com/778 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Generative AI is reshaping our world, and Bernard Marr, world-renowned futurist and best-selling author, joins Jon Krohn to guide us through this transformation. In this episode, Bernard shares his insights on how AI is transforming industries, revolutionizing daily life, and addressing global challenges. With his extensive experience advising top organizations worldwide, he also examines the ethical considerations of AI deployment. This episode is brought to you by Intel and HPE Ezmeral Software (https://bit.ly/hpeintel). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How Generative AI will transform industries [03:55] • The evolution of Generative AI [10:19] • How will Generative AI impact daily life [16:52] • The ethical challenges of AI [18:55] • How corporations can harness Generative AI for collaboration [24:36] • Industries that will be impacted by Generative AI [32:20] • How Sora-like Generative AI systems will create highly immersive entertainment [42:16] • How Generative AI could unlock 99% of business data [53:34] Additional materials: www.superdatascience.com/777
What are the risks of AI progressing beyond a point of no return? What do we stand to gain? On this Five-Minute Friday, Jon Krohn talks ‘books’ as he outlines two nonfiction works on AI and futurism by Oxford philosopher Nick Bostrom. Listen to a breakdown of DEEP UTOPIA and SUPERINTELLIGENCE in this episode. Additional materials: www.superdatascience.com/776 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Tech entrepreneurship, artificial superintelligence, and the future of education: Aleksa Gordić speaks to Jon Krohn about his strategies for self-directed learning, the traits that help people succeed in moving from big tech to entrepreneurship, and the social impact of artificial superintelligence. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How to motivate yourself to become a tech entrepreneur [17:02] • Aleksa’s checklist for the perfect CTO [35:00] • Potential sustainable solutions for LLMs [41:51] • The next major developments in AI and tech [48:29] • How hobbies have a knock-on effect for a person’s career [1:01:53] • How and why formal education needs to change [1:09:24] Additional materials: www.superdatascience.com/775
Covariant's RFM-1: Jon Krohn explores the future of AI-driven robotics with RFM-1, a groundbreaking robot arm designed by Covariant and discussed by A.I. roboticist Pieter Abbeel. Explore how this innovation aims to merge digital intelligence with the physical world, promising a new era of efficiency and autonomy. Additional materials: www.superdatascience.com/774 Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information.
Dr. Barrett Thomas, an award-winning Research Professor at the University of Iowa, explores the intricacies of Markov decision processes and their connection to Deep Reinforcement Learning. Discover how these concepts are applied in operations research to enhance business efficiency and drive innovations in same-day delivery and autonomous transportation systems. This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/). Interested in sponsoring a SuperDataScience Podcast episode? Visit passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • Barrett's start in operations logistics [02:27] • Concorde Solver and the traveling salesperson problem [09:59] • Cross-function approximation explained [19:13] • How Markov decision processes relate to deep reinforcement learning [26:08] • Understanding policy in decision-making contexts [33:40] • Revolutionizing supply chains and transportation with aerial drones [46:47] • Barrett’s career evolution: past changes and future prospects [52:19] Additional materials: www.superdatascience.com/773
loading
Comments (29)

atefeh

thank you for this episode.

Mar 5th
Reply (1)

mrs rime

🔴💚Really Amazing ️You Can Try This💚WATCH💚ᗪOᗯᑎᒪOᗩᗪ👉https://co.fastmovies.org

Jan 16th
Reply

Priya Dharshini

🔴WATCH>>ᗪOᗯᑎᒪOᗩᗪ>>👉https://co.fastmovies.org

Jan 16th
Reply

Andrew Miller

I found this podcast really helpful for anyone who wants to better their knowledge of machine learning. I am especially interested in the data processing. If you want to deepen your knowledge of this topic, check this article https://techlogitic.net/categorization-and-data-labeling-for-supervised-machine-learning/. It has some pretty useful information and professional tips from experts in data annotation and tagging.

Apr 21st
Reply

Toben Nelson

a really nice and quick overview with just the right amount of detail.

Mar 3rd
Reply

Maryam Alizadeh

great thanks to you and your endeavors for this pod. I learnt a lot. welcome to Jon , wish you the best 👏👍

Jan 4th
Reply

Maryam Alizadeh

😢

Jan 4th
Reply

Masoud Fard

you are the best

Nov 19th
Reply

Nikhil Parmar

nice summarisation, Data Analyts looks at the past and data scientist looks at past and future

Oct 27th
Reply

Tough Nut

Great talk, very inspiring. thanks.

Aug 6th
Reply

Venkat M

Sleeps 3 hrs a day, not a good example for healthy person. sleep well and keep the brain more refreshed and healthy. #health

Mar 13th
Reply

Mehrdad Salimi

a lot of extra, unrelated stuff. Dude I appreciate your effort but you need to be specific and respect audiences' time.

Jan 19th
Reply

Maria Lacerda

Eu não conhecia Gabriela de Queiroz mas agora ouvindo esse podcast (já ouvi umas 5x) estou completamente encantada. Muito legal descobrir esse nível de profissional pelo mundo e ainda saber que trata-se de uma brasileira.

Dec 2nd
Reply

Natalia Zawadzka

Great job!👍 It's so interesting to listen your podcasts! thanks for sharing your knowledge and helping people to get into data business 🙌👍

Sep 10th
Reply

SriLatha K

Hi thanks for doing this podcast. Being a data engineer and who commutes a lot, I gain a lot from your podcasts. One suggestion that I would like to give is, it would be better if you do not interrupt the speaker until they complete their flow.

May 10th
Reply

Alberto Andrade

What amazing episode! Adrian rocks!! Congratulations!

Apr 25th
Reply

Simon SOUVANNARAT

Thanks for this advice !

Feb 6th
Reply

Troy Kirin

Great episode! I wish he touched on how to connect Sparklyr to data viz like Tableau!

Dec 10th
Reply

Richard Leyshon

thought this was one of my stoic podcast episodes! Great message.

Dec 1st
Reply

Ari Meier

Great episode! I'd love to access the show notes, but is having an issue pulling up the link.

Nov 10th
Reply