This show is your guidebook to building scalable and maintainable AI systems. You will learn how to architect AI applications, apply AI to your work, and the considerations involved in building or customizing new models. Everything that you need to know to deliver real impact and value with machine learning and artificial intelligence.
Support the show!Listen in your favorite app:
FountainHere are shows you might like
Summary In this episode of the AI Engineering Podcast Ron Green, co-founder and CTO of KungFu AI, talks about the evolving landscape of AI systems and the challenges of harnessing generative AI engines. Ron shares his insights on the limitations of large language models (LLMs) as standalone solutions and emphasizes the need for human oversight,…
Summary In this episode of the AI Engineering Podcast Ron Green, co-founder and CTO of KungFu AI,…
16 December 2024 | 00:55:13
Summary In this episode of the AI Engineering Podcast Jim Olsen, CTO of ModelOp, talks about the governance of generative AI models and applications. Jim shares his extensive experience in software engineering and machine learning, highlighting the importance of governance in high-risk applications like healthcare. He explains that governance is…
Summary In this episode of the AI Engineering Podcast Jim Olsen, CTO of ModelOp, talks about the…
01 December 2024 | 00:54:19
Summary In this episode of the AI Engineering Podcast, Vasilije Markovich talks about enhancing Large Language Models (LLMs) with memory to improve their accuracy. He discusses the concept of memory in LLMs, which involves managing context windows to enhance reasoning without the high costs of traditional training methods. He explains the…
Summary In this episode of the AI Engineering Podcast, Vasilije Markovich talks about enhancing…
25 November 2024 | 00:55:01
Summary In this episode of the AI Engineering Podcast, Tanner Burson, VP of Engineering at Prismatic, talks about the evolving impact of generative AI on software developers. Tanner shares his insights from engineering leadership and data engineering initiatives, discussing how AI is blurring the lines of developer roles and the strategic value of…
Summary In this episode of the AI Engineering Podcast, Tanner Burson, VP of Engineering at…
22 November 2024 | 00:52:58
Summary Machine learning workflows have long been complex and difficult to operationalize. They are often characterized by a period of research, resulting in an artifact that gets passed to another engineer or team to prepare for running in production. The MLOps category of tools have tried to build a new set of utilities to reduce that friction,…
Summary Machine learning workflows have long been complex and difficult to operationalize. They are…
11 November 2024 | 01:16:12
Summary With the growth of vector data as a core element of any AI application comes the need to keep those vectors up to date. When you go beyond prototypes and into production you will need a way to continue experimenting with new embedding models, chunking strategies, etc. You will also need a way to keep the embeddings up to date as your data…
Summary With the growth of vector data as a core element of any AI application comes the need to…
11 November 2024 | 00:53:50
Summary In this episode Philip Kiely from BaseTen talks about the intricacies of running open models in production. Philip shares his journey into AI and ML engineering, highlighting the importance of understanding product-level requirements and selecting the right model for deployment. The conversation covers the operational aspects of deploying…
Summary In this episode Philip Kiely from BaseTen talks about the intricacies of running open models…
28 October 2024 | 00:57:37
Summary In this episode of the AI Engineering podcast, Philip Rathle, CTO of Neo4J, talks about the intersection of knowledge graphs and AI retrieval systems, specifically Retrieval Augmented Generation (RAG). He delves into GraphRAG, a novel approach that combines knowledge graphs with vector-based similarity search to enhance generative AI…
Summary In this episode of the AI Engineering podcast, Philip Rathle, CTO of Neo4J, talks about the…
10 September 2024 | 00:59:06
Summary In this episode of the AI Engineering podcast Praveen Gujar, Director of Product at LinkedIn, talks about the applications of generative AI in digital advertising. He highlights the key areas of digital advertising, including audience targeting, content creation, and ROI measurement, and delves into how generative AI is revolutionizing…
Summary In this episode of the AI Engineering podcast Praveen Gujar, Director of Product at…
02 September 2024 | 00:41:49
Summary In this episode of the AI Engineering podcast, host Tobias Macy interviews Tammer Saleh, founder of SuperOrbital, about the potentials and pitfalls of using Kubernetes for machine learning workloads. The conversation delves into the specific needs of machine learning workflows, such as model tracking, versioning, and the use of Jupyter…
Summary In this episode of the AI Engineering podcast, host Tobias Macy interviews Tammer Saleh,…
15 August 2024 | 00:50:22