InfoQ Homepage Large language models Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

The Data Backbone of LLM Systems

Paul Iusztin discusses the evolution of AI engineering, highlighting the shift from model training to foundational models. He shares insights on scalable LLM systems and optimizing RAG.

Paul Iusztin
on Sep 10, 2025

Icon

51:25
AI, ML & Data Engineering

Enhance LLMs’ Explainability and Trustworthiness with Knowledge Graphs

Leann Chen discusses how knowledge graphs provide structured data to enhance LLM accuracy, tackling common challenges like hallucinations and the "lost-in-the-middle" phenomenon in RAG systems.

Leann Chen
on Jul 22, 2025

Icon

52:25
AI, ML & Data Engineering

AI Agents & LLMs: Scaling the Next Wave of Automation

The panelists discuss AI agents and LLMs, exploring their definitions, architectures, use cases, reliability, and impact on the SDLC and future of automation.

Govind Kamtamneni Hien Luu Karthik Ramgopal Srini Penchikala
on Jul 09, 2025

Icon

01:02:48
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

Denys Linkov discusses critical lessons for senior developers and leaders on building robust LLM systems and actionable metrics that prevent production issues and drive business value.

Denys Linkov
on Jul 01, 2025

Icon

29:10
AI, ML & Data Engineering

Scaling Large Language Model Serving Infrastructure at Meta

Ye (Charlotte) Qi explains key considerations for optimizing LLM inference, including hardware, latency, and production scaling strategies.

Ye Qi
on May 29, 2025

Icon

49:59
AI, ML & Data Engineering

How Green is Green: LLMs to Understand Climate Disclosure at Scale

Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.

Leo Browning
on Apr 22, 2025

Icon

47:29
AI, ML & Data Engineering

LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries

Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.

Stefania Chaplin Azhir Mahmood
on Apr 17, 2025

Icon

43:50
AI, ML & Data Engineering

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.

Anil Rajput Rema Hariharan
on Apr 07, 2025

Icon

48:11
AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.

Meryem Arik
on Mar 28, 2025

Icon

39:49
Architecture & Design

How GitHub Copilot Serves 400 Million Completion Requests a Day

David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.

David Cheney
on Mar 24, 2025

Icon

49:24
AI, ML & Data Engineering

Leveraging Open-source LLMs for Production

Andrey Cheptsov discusses the practical use of open-source LLMs for real-world applications, weighing their pros and cons, highlighting advantages like privacy and cost-efficiency.

Andrey Cheptsov
on Feb 12, 2025

Icon

44:16
AI, ML & Data Engineering

Taking LLMs out of the Black Box: A Practical Guide to Human-in-the-Loop Distillation

Ines Montani discusses practical solutions for using the latest LLMs in real-world applications and explores how to distill knowledge into smaller and faster components.

Ines Montani
on Feb 05, 2025

Icon

49:47

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations