InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

Why Most Machine Learning Projects Fail to Reach Production

In this article, the author diagnoses common failures in ML initiatives, including weak problem framing and the persistent prototype-to-production gap. The piece provides practical, experience-based guidance on setting clear business goals, treating data as a product, and aligning cross-functional teams for reliable, production-ready ML delivery.

Wenjie Zi
on Feb 02, 2026
AI, ML & Data Engineering

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

This article introduces a reinforcement learning (RL) approach grounded in Apache Spark that enables distributed computing systems to learn optimal configurations autonomously, much like an apprentice engineer who learns by doing. The author also implements a lightweight agent as a driver-side component that uses RL to choose configuration settings before a job runs.

Hina Gandhi
on Jan 30, 2026
AI, ML & Data Engineering

Virtual Panel - AI in the Trenches: How Developers Are Rewriting the Software Process

This virtual panel brings together engineers, architects, and technical leaders to explore how AI is changing the landscape of software development. Practitioners share their insights on successes and failures when AI is incorporated into daily workflows, emphasizing the significance of context, validation, and cultural adaptation in making AI a sustainable element of modern engineering practices.

Arthur Casals Mariia Bulycheva May Walter Phil Calçado Andreas Kollegger
on Jan 26, 2026
AI, ML & Data Engineering

Article Series: AI-Assisted Development: Real World Patterns, Pitfalls, and Production Readiness

In this series, we examine what happens after the proof of concept and how AI becomes part of the software delivery pipeline. As AI transitions from proof of concept to production, teams are discovering that the challenge extends beyond model performance to include architecture, process, and accountability. This transition is redefining what constitutes good software engineering.

Arthur Casals
on Jan 21, 2026
AI, ML & Data Engineering

Agentic Terminal - How Your Terminal Comes Alive with CLI Agents

In this article author Sachin Joglekar discusses the transformation of CLI terminals becoming agentic where developers can state goals while the AI agents plan, call tools, iterate, ask for approval where needed, and execute the requests. He also explains the planning styles for three different CLI tools: Gemini, Claude, and Auto-GPT.

Sachin Joglekar
on Jan 08, 2026
Architecture & Design

Where Architects Sit in the Era of AI

As AI evolves from tool to collaborator, architects must shift from manual design to meta-design. This article introduces the "Three Loops" framework (In, On, Out) to help navigate this transition. It explores how to balance oversight with delegation, mitigate risks like skill atrophy, and design the governance structures that keep AI-augmented systems safe and aligned with human intent.

Dave Holliday João Carlos Gonçalves Manoj Kumar Yadav
on Dec 19, 2025
Architecture & Design

Architecture in a Flow of AI-Augmented Change

While AI adoption is surging, most organizations fail to scale past pilots. The solution lies in organizational structure, not just technology. This article details how architects can enable "fast flow" by defining clear domains and guardrails. Learn how to shift from controlling outcomes to curating context, allowing AI to drive continuous, valuable business change.

Jonathan McPhail Juan Medina Jake DeCrane Isuru Wijesundara
on Dec 18, 2025
AI, ML & Data Engineering

NextGen Search - Where AI Meets OpenSearch through MCP

In this article, authors Srikanth Daggumalli and Arun Lakshmanan discuss next-generation context-aware conversational search using OpenSearch and AI agents powered by Large Language Models (LLMs) and Model Context Protocol (MCP).

Srikanth Daggumalli Arun Lakshmanan
on Dec 17, 2025
AI, ML & Data Engineering

Reducing False Positives in Retrieval-Augmented Generation (RAG) Semantic Caching: a Banking Case Study

In this article, author Elakkiya Daivam discusses why Retrieval Augmented Generation (RAG) and semantic caching techniques are powerful levers for reducing false positives in AI powered applications. She shares the insights from a production-grade evaluation with 1,000 query variations tested across seven bi-encoder models.

Elakkiya Daivam
on Nov 14, 2025
AI, ML & Data Engineering

Training Data Preprocessing for Text-to-Video Models

In this article, author Aleksandr Rezanov discusses the data preparation for generative text-to-image models to accelerate work on video generation services to be used in TV series and films. He explains how data is prepared and can serve as a starting point for creating custom datasets to develop proprietary models.

Aleksandr Rezanov
on Nov 06, 2025
Culture & Methods

A Plan-Do-Check-Act Framework for AI Code Generation

AI code generation tools promise faster development but often create quality issues, integration problems, and delivery delays. A structured Plan-Do-Check-Act cycle can maintain code quality while leveraging AI capabilities. Through working agreements, structured prompts, and continuous retrospection, it asserts accountability over code while guiding AI to produce tested, maintainable software.

Ken Judy
on Oct 20, 2025
AI, ML & Data Engineering

Disaggregation in Large Language Models: the Next Evolution in AI Infrastructure

Large Language Model (LLM) inference faces a fundamental challenge: the same hardware that excels at processing input prompts struggles with generating responses, and vice versa. Disaggregated serving architectures solve this by separating these distinct computational phases, delivering throughput improvements and better resource utilization while reducing costs.

Anat Heilper
on Sep 29, 2025

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles