AI Engineering Podcast

AI Engineering Podcast



This show is your guidebook to building scalable and maintainable AI systems. You will learn how to architect AI applications, apply AI to your work, and the considerations involved in building or customizing new models. Everything that you need to know to deliver real impact and value with machine learning and artificial intelligence.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
Data Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

55 Episodes

Harnessing The Engine Of AI - E44

Summary In this episode of the AI Engineering Podcast Ron Green, co-founder and CTO of KungFu AI, talks about the evolving landscape of AI systems and the challenges of harnessing generative AI engines. Ron shares his insights on the limitations of large language models (LLMs) as standalone solutions and emphasizes the need for human oversight,…

Summary In this episode of the AI Engineering Podcast Ron Green, co-founder and CTO of KungFu AI,…

16 December 2024 | 00:55:13


The Complex World of Generative AI Governance - E43

Summary In this episode of the AI Engineering Podcast Jim Olsen, CTO of ModelOp, talks about the governance of generative AI models and applications. Jim shares his extensive experience in software engineering and machine learning, highlighting the importance of governance in high-risk applications like healthcare. He explains that governance is…

Summary In this episode of the AI Engineering Podcast Jim Olsen, CTO of ModelOp, talks about the…

01 December 2024 | 00:54:19


Building Semantic Memory for AI With Cognee - E42

Summary In this episode of the AI Engineering Podcast, Vasilije Markovich talks about enhancing Large Language Models (LLMs) with memory to improve their accuracy. He discusses the concept of memory in LLMs, which involves managing context windows to enhance reasoning without the high costs of traditional training methods. He explains the…

Summary In this episode of the AI Engineering Podcast, Vasilije Markovich talks about enhancing…

25 November 2024 | 00:55:01


The Impact of Generative AI on Software Development - E41

Summary In this episode of the AI Engineering Podcast, Tanner Burson, VP of Engineering at Prismatic, talks about the evolving impact of generative AI on software developers. Tanner shares his insights from engineering leadership and data engineering initiatives, discussing how AI is blurring the lines of developer roles and the strategic value of…

Summary In this episode of the AI Engineering Podcast, Tanner Burson, VP of Engineering at…

22 November 2024 | 00:52:58


ML Infrastructure Without The Ops: Simplifying The ML Developer Experience With Runhouse - E40

Summary Machine learning workflows have long been complex and difficult to operationalize. They are often characterized by a period of research, resulting in an artifact that gets passed to another engineer or team to prepare for running in production. The MLOps category of tools have tried to build a new set of utilities to reduce that friction,…

Summary Machine learning workflows have long been complex and difficult to operationalize. They are…

11 November 2024 | 01:16:12


Building AI Systems on Postgres: An Inside Look at pgai Vectorizer - E39

Summary With the growth of vector data as a core element of any AI application comes the need to keep those vectors up to date. When you go beyond prototypes and into production you will need a way to continue experimenting with new embedding models, chunking strategies, etc. You will also need a way to keep the embeddings up to date as your data…

Summary With the growth of vector data as a core element of any AI application comes the need to…

11 November 2024 | 00:53:50


Running Generative AI Models In Production - E38

Summary In this episode Philip Kiely from BaseTen talks about the intricacies of running open models in production. Philip shares his journey into AI and ML engineering, highlighting the importance of understanding product-level requirements and selecting the right model for deployment. The conversation covers the operational aspects of deploying…

Summary In this episode Philip Kiely from BaseTen talks about the intricacies of running open models…

28 October 2024 | 00:57:37


Enhancing AI Retrieval with Knowledge Graphs: A Deep Dive into GraphRAG - E37

Summary In this episode of the AI Engineering podcast, Philip Rathle, CTO of Neo4J, talks about the intersection of knowledge graphs and AI retrieval systems, specifically Retrieval Augmented Generation (RAG). He delves into GraphRAG, a novel approach that combines knowledge graphs with vector-based similarity search to enhance generative AI…

Summary In this episode of the AI Engineering podcast, Philip Rathle, CTO of Neo4J, talks about the…

10 September 2024 | 00:59:06


Harnessing Generative AI for Effective Digital Advertising Campaigns - E36

Summary In this episode of the AI Engineering podcast Praveen Gujar, Director of Product at LinkedIn, talks about the applications of generative AI in digital advertising. He highlights the key areas of digital advertising, including audience targeting, content creation, and ROI measurement, and delves into how generative AI is revolutionizing…

Summary In this episode of the AI Engineering podcast Praveen Gujar, Director of Product at…

02 September 2024 | 00:41:49


Building Scalable ML Systems on Kubernetes - E35

Summary In this episode of the AI Engineering podcast, host Tobias Macy interviews Tammer Saleh, founder of SuperOrbital, about the potentials and pitfalls of using Kubernetes for machine learning workloads. The conversation delves into the specific needs of machine learning workflows, such as model tracking, versioning, and the use of Jupyter…

Summary In this episode of the AI Engineering podcast, host Tobias Macy interviews Tammer Saleh,…

15 August 2024 | 00:50:22