InfoQ Homepage Generative AI Content on InfoQ

Articles

RSS Feed

Newer Older

Development

The Technology Adoption Curve, Twenty Years On

Today, June 8th, InfoQ celebrates 20 years. This is not a comprehensive history, but a deliberately selective look at the technologies and practices InfoQ identified early, where they sit on the adoption curve in 2026, and how that curve may evolve over the next five to ten years.

Renato Losio Dio Synodinos
on Jun 08, 2026
AI, ML & Data Engineering

Why Vector Search Alone Isn't Enough: Hybrid Retrieval for RAG

In this article, author Aaditya Chauhan discusses the limitations of RAG pipelines based purely on vector search and how an internal omni-search application using Reciprocal Rank Fusion (RRF) that combines BM25 and vector results, can enhance the search solution.

Aaditya Chauhan
on Jun 02, 2026
Cloud

Local-First AI Inference: a Cloud Architecture Pattern for Cost-Effective Document Processing

The Local-First AI Inference pattern routes 70–80% of documents to deterministic local extraction at zero API cost, reserving Azure OpenAI calls for edge cases and flagging low-confidence results for human review. Deployed on 4,700 engineering drawing PDFs, it cut API costs by 75% and processing time by 55%, while bounding errors through a human review tier.

Obinna Iheanachor
on May 11, 2026
DevOps

From Alert Fatigue to Agent-Assisted Intelligent Observability

As systems grow, observability becomes harder to maintain and incidents harder to diagnose. Agentic observability layers AI on existing tools, starting in read-only mode to detect anomalies and summarize issues. Over time, agents add context, correlate signals, and automate low-risk tasks. This approach frees engineers to focus on analysis and judgment.

Rohit Dhawan
on Feb 04, 2026
Architecture & Design

Spec Driven Development: When Architecture Becomes Executable

Spec-Driven Development inverts traditional architecture by making specifications executable and authoritative. It transforms declared intent into validated code through AI generation and provides architectural determinism. It eliminates drift through continuous enforcement, but demands new engineering discipline in schema design and contract-first reasoning.

Leigh Griffin Ray Carroll
on Jan 12, 2026
AI, ML & Data Engineering

Agentic Terminal - How Your Terminal Comes Alive with CLI Agents

In this article author Sachin Joglekar discusses the transformation of CLI terminals becoming agentic where developers can state goals while the AI agents plan, call tools, iterate, ask for approval where needed, and execute the requests. He also explains the planning styles for three different CLI tools: Gemini, Claude, and Auto-GPT.

Sachin Joglekar
on Jan 08, 2026
AI, ML & Data Engineering

InfoQ AI, ML and Data Engineering Trends Report - 2025

This InfoQ Trends Report offers readers a comprehensive overview of emerging trends and technologies in the areas of AI, ML, and Data Engineering. This report summarizes the InfoQ editorial team’s and external guests' view on the current trends in AI and ML technologies and what to look out for in the next 12 months.

Srini Penchikala Savannah Kunovsky Anthony Alford Daniel Dominguez Vinod Goje
on Sep 24, 2025
AI, ML & Data Engineering

Domain-Driven RAG: Building Accurate Enterprise Knowledge Systems through Distributed Ownership

Retrieval augmented generation (RAG) can help reduce LLM hallucination. Learn how applying high-quality metadata and distributing ownership of documents and prompts to domain experts can further increase accuracy in RAG applications. An additional layer of intelligence can use metadata to focus RAG searches on a specific domain for even better results.

George Panagiotopoulos
on May 06, 2025
AI, ML & Data Engineering

Beyond Chatbots: Architecting Domain-Specific Generative AI for Operational Decision-Making

This article explores the use of domain-specific Generative AI, models that understand operational constraints, real-world dynamics, and business rules to generate executable strategies, not just text descriptions. These models require significantly smaller datasets and fewer parameters, making them cost-effective while enabling AI-driven core business decision intelligence at scale.

Abhishek Goswami
on Apr 02, 2025
AI, ML & Data Engineering

Building Trust in AI: Security and Risks in Highly Regulated Industries

Explore the transformative power of responsible AI across industries, emphasizing security, MLOps, and compliance. As AI drives innovation—from predicting hurricanes to enhancing legal workflows—organizations must prioritize ethical practices, transparency, and robust governance to safeguard sensitive data while navigating an evolving regulatory landscape.

Stefania Chaplin Azhir Mahmood
on Feb 10, 2025
AI, ML & Data Engineering

Launching GenAI Productivity Tools: Insights and Lessons

In this article, based on a talk at QCon San Francisco 2024, author Mandy Gu shares some of the ways her company uses GenAI to enhance productivity and the lessons they learned along the way, including failed bets and features that were rolled back because of low user adoption. Most important, they learned to focus on building tools that were aligned with business goals.

Mandy Gu
on Feb 06, 2025
AI, ML & Data Engineering

Elevate Developer Experience with Generative AI Capabilities on AWS

This is a summary of a talk I gave at InfoQ Dev Summit Munich 2024. I discussed the transformative potential of generative AI in enhancing developer experiences, particularly through AWS. I’ll introduce key tools like Amazon Bedrock, Code Review Assistant, Agentic Code Generation, and Code Summarization in this article.

Olalekan Elesin
on Jan 27, 2025

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles