InfoQ Homepage Large language models Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Fine Tuning the Enterprise: Reinforcement Learning in Practice

Will Hang and Wenjie Zi explain how OpenAI's Agent RFT (Reinforcement Fine-Tuning) optimizes reasoning models end-to-end, enabling autonomous agents to learn efficient tool use and cut latencies.

Wenjie Zi Will Hang
on Jul 03, 2026

Icon

48:47
AI, ML & Data Engineering

Graph RAG: Building Smarter Retrieval Workflows with Knowledge Graphs

Cassie Shum explains how to move beyond traditional RAG limits using GraphRAG to inject enterprise logic and complex multi-hop reasoning into scalable knowledge graphs within a data warehouse.

Cassie Shum
on Jul 01, 2026

Icon

50:54
AI, ML & Data Engineering

Rules for Understanding Language Models

Naomi Saphra explains why language models act like populations rather than individuals. She discusses how data diversity drives generalization, why LLMs mirror users and how tokenizers alter behavior.

Naomi Saphra
on Jun 24, 2026

Icon

40:28
AI, ML & Data Engineering

From Hype to Strong Foundations: What the Rise, Fall and Resurgence of Agents Can Teach Us about Outlasting the Cycle

Aditya Kumarakrishnan discusses "Agents: The Missing Manual," sharing four historically grounded ideas to build modular, durable, and hyper-tenant AI agent architectures.

Aditya Kumarakrishnan
on Jun 17, 2026

Icon

50:24
AI, ML & Data Engineering

Building Evals for AI Adoption: from Principles to Practice

Mallika Rao explains how evaluation debt silently triggers regressions in distributed AI systems. She shares a five-layer evaluation stack to align metrics directly with long-term user trust.

Mallika Rao
on May 29, 2026

Icon

52:58
Culture & Methods

Accelerating LLM-Driven Developer Productivity at Zoox

Amit Navindgi explains how Zoox built Cortex, an internal AI platform that streamlines the developer lifecycle by moving beyond the hype to deliver secure, agentic workflows and real-world impact.

Amit Navindgi
on May 14, 2026

Icon

50:06
AI, ML & Data Engineering

Dynamic Moments: Weaving LLMs into Deep Personalization at DoorDash

Sudeep Das and Pradeep Muthukrishnan discuss how DoorDash combines LLMs with deep learning to move from "classic" collaborative filtering to "hyper-personalization" in real-time commerce.

Sudeep Das Pradeep Muthukrishnan
on Apr 21, 2026

Icon

47:02
AI, ML & Data Engineering

Building Embedding Models for Large-Scale Real-World Applications

Sahil Dua explains the architecture and training of embedding models. He shares practical tips for distilling large models and scaling RAG applications for real-time production environments.

Sahil Dua
on Feb 13, 2026

Icon

44:06
AI, ML & Data Engineering

Ecologies and Economics of Language AI in Practice

Jade Abbott explains how to build sustainable AI using "Little LMs." She discusses environmental impacts, linguistic justice, and technical optimizations like quantization and model distillation.

Jade Abbott
on Dec 24, 2025

Icon

50:19
AI, ML & Data Engineering

Why Observability Matters (More!) with AI Applications

Sally O'Malley shares how to build an AI observability stack with open-source tools (Prometheus, Grafana, OpenTelemetry, Tempo, vLLM/Llama Stack). Learn to track performance, quality and cost signals.

Sally O'Malley
on Oct 20, 2025

Icon

51:05
AI, ML & Data Engineering

Deploy MultiModal RAG Systems with vLLM

Stephen Batifol explains the core concepts of multimodal RAG systems, vector search indexes (HNSW, IVF), and embedding model selection. He details vLLM and Pixtral for optimized inference..

Stephen Batifol
on Oct 10, 2025

Icon

47:26
AI, ML & Data Engineering

Chatting with Your Knowledge Graph

Jonathan Lowe discusses how to enable an LLM to chat with a structured graph database. He explains the process of using semantic search and knowledge graphs to answer natural language questions.

Jonathan Lowe
on Sep 15, 2025

Icon

50:20

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations