InfoQ Homepage Large language models Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Ecologies and Economics of Language AI in Practice

Jade Abbott explains how to build sustainable AI using "Little LMs." She discusses environmental impacts, linguistic justice, and technical optimizations like quantization and model distillation.

Jade Abbott
on Dec 24, 2025

Icon

50:19
AI, ML & Data Engineering

Why Observability Matters (More!) with AI Applications

Sally O'Malley shares how to build an AI observability stack with open-source tools (Prometheus, Grafana, OpenTelemetry, Tempo, vLLM/Llama Stack). Learn to track performance, quality and cost signals.

Sally O'Malley
on Oct 20, 2025

Icon

51:05
AI, ML & Data Engineering

Deploy MultiModal RAG Systems with vLLM

Stephen Batifol explains the core concepts of multimodal RAG systems, vector search indexes (HNSW, IVF), and embedding model selection. He details vLLM and Pixtral for optimized inference..

Stephen Batifol
on Oct 10, 2025

Icon

47:26
AI, ML & Data Engineering

Chatting with Your Knowledge Graph

Jonathan Lowe discusses how to enable an LLM to chat with a structured graph database. He explains the process of using semantic search and knowledge graphs to answer natural language questions.

Jonathan Lowe
on Sep 15, 2025

Icon

50:20
AI, ML & Data Engineering

The Data Backbone of LLM Systems

Paul Iusztin discusses the evolution of AI engineering, highlighting the shift from model training to foundational models. He shares insights on scalable LLM systems and optimizing RAG.

Paul Iusztin
on Sep 10, 2025

Icon

51:25
AI, ML & Data Engineering

Enhance LLMs’ Explainability and Trustworthiness with Knowledge Graphs

Leann Chen discusses how knowledge graphs provide structured data to enhance LLM accuracy, tackling common challenges like hallucinations and the "lost-in-the-middle" phenomenon in RAG systems.

Leann Chen
on Jul 22, 2025

Icon

52:25
AI, ML & Data Engineering

AI Agents & LLMs: Scaling the Next Wave of Automation

The panelists discuss AI agents and LLMs, exploring their definitions, architectures, use cases, reliability, and impact on the SDLC and future of automation.

Govind Kamtamneni Hien Luu Karthik Ramgopal Srini Penchikala
on Jul 09, 2025

Icon

01:02:48
AI, ML & Data Engineering

A Framework for Building Micro Metrics for LLM System Evaluation

Denys Linkov discusses critical lessons for senior developers and leaders on building robust LLM systems and actionable metrics that prevent production issues and drive business value.

Denys Linkov
on Jul 01, 2025

Icon

29:10
AI, ML & Data Engineering

Scaling Large Language Model Serving Infrastructure at Meta

Ye (Charlotte) Qi explains key considerations for optimizing LLM inference, including hardware, latency, and production scaling strategies.

Ye Qi
on May 29, 2025

Icon

49:59
AI, ML & Data Engineering

How Green is Green: LLMs to Understand Climate Disclosure at Scale

Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.

Leo Browning
on Apr 22, 2025

Icon

47:29
AI, ML & Data Engineering

LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries

Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.

Stefania Chaplin Azhir Mahmood
on Apr 17, 2025

Icon

43:50
AI, ML & Data Engineering

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.

Anil Rajput Rema Hariharan
on Apr 07, 2025

Icon

48:11

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations