InfoQ Homepage Machine Learning Content on InfoQ
-
Humans in the Loop: Engineering Leadership in a Chaotic Industry
Michelle Brush discusses engineering leadership in the age of AI/ML and automation.
-
Growing and Cultivating Strong Machine Learning Engineers
Vivek Gupta explains how to nourish and cultivate Machine Learning engineers, detailing the unique production-ML skills required for scaling, governance, and LLMOps.
-
Achieving Precision in AI: Retrieving the Right Data Using AI Agents
Adi Polak discusses achieving precision in GenAI by moving beyond RAG to Agentic RAG. She details agent patterns, feedback loops, and using data streaming architectures to scale real-time AI.
-
AI for Food Image Generation in Production: How & Why
Iaroslav Amerkhanov discusses how his team at Delivery Hero leveraged GenAI to generate food images, detailing the architecture, optimization, and business impact.
-
10 Reasons Your Multi-Agent Workflows Fail and What You Can Do about It
Victor Dibia discusses multi-agent systems, detailing how to build them with AutoGen, common failure points, and strategic approaches for senior software developers and engineering leaders.
-
Maximizing Deep Learning Performance on CPUs using Modern Architectures
Bibek Bhattarai demystifies Intel AMX, explaining how this CPU architecture accelerates deep learning workloads via low-precision matrix multiplication and efficient data handling.
-
A Framework for Building Micro Metrics for LLM System Evaluation
Denys Linkov discusses critical lessons for senior developers and leaders on building robust LLM systems and actionable metrics that prevent production issues and drive business value.
-
Supporting Diverse ML Systems at Netflix
David Berg and Romain Cledat discuss Metaflow, Netflix's ML infrastructure for diverse use cases from computer vision to recommendations.
-
From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models
Sebastiano Galazzo shares practical tips and mistakes in creating custom LLMs for cost-effective AI. Learn LoRA, merging, MoE & optimization.
-
How Green is Green: LLMs to Understand Climate Disclosure at Scale
Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.
-
LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries
Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.
-
Unleashing Llama's Potential: CPU-Based Fine-Tuning
Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.