InfoQ Homepage Programming Content on InfoQ

Articles

RSS Feed

Newer Older

DevOps

Article Series: Securing the AI Stack: from Model to Production

This series provides your roadmap for the machine age, exploring how to move from vulnerable prototypes to resilient systems through layered defense, robust MLOps, and integrated governance.

Claudio Masolo
on Jun 05, 2026
AI, ML & Data Engineering

Why Vector Search Alone Isn't Enough: Hybrid Retrieval for RAG

In this article, author Aaditya Chauhan discusses the limitations of RAG pipelines based purely on vector search and how an internal omni-search application using Reciprocal Rank Fusion (RRF) that combines BM25 and vector results, can enhance the search solution.

Aaditya Chauhan
on Jun 02, 2026
Web Development

The AI Productivity Paradox in Test Automation: Moving beyond Structural Validation to Perception and Intent

The AI productivity paradox states that AI scales whatever abstraction it is built on. If that abstraction is structurally brittle, it scales structural brittleness. This article shows that to build a future of reliable, AI-driven test automation, we must stop scaling DOM-centric abstractions and build a new testing paradigm grounded in perception and intent.

Amanul Chowdhury Vinay Gummadavelli
on Jun 01, 2026
Java

The Schema Proliferation Problem in Kafka and Flink Pipelines: How to Solve It

Schema proliferation builds slowly and gets expensive fast. One schema per event type feels right until there are ten tables, union queries spanning all of them, and a single field rename touching every schema. Discriminator-based schema consolidation collapses that to two tables, turning multi-table unions into a single query, while new variants are additive and don't break existing consumers.

Spoorthi Basu
on May 25, 2026
Cloud

Local-First AI Inference: a Cloud Architecture Pattern for Cost-Effective Document Processing

The Local-First AI Inference pattern routes 70–80% of documents to deterministic local extraction at zero API cost, reserving Azure OpenAI calls for edge cases and flagging low-confidence results for human review. Deployed on 4,700 engineering drawing PDFs, it cut API costs by 75% and processing time by 55%, while bounding errors through a human review tier.

Obinna Iheanachor
on May 11, 2026
.NET

Implementing the Sidecar Pattern in Microservices-Based ASP.NET Core Applications

Today's applications require monitoring, logging, configuration, etc. Each of these concerns can be implemented as a component or a service. These cross-cutting concerns can be tightly integrated into the application. While this tight coupling ensures effective use of shared resources, an outage in any of these components can take your application down. Enter the sidecar design pattern.

Joydip Kanjilal
on May 08, 2026
Java

MCP in the Java World: Bringing Architectural Strategy to LLM Integrations

Discover how the Model Context Protocol (MCP) Java SDK is establishing a new architectural discipline for enterprise LLM integrations. By defining explicit contracts and leveraging MCP servers as anti-corruption layers, it ensures governance, loose coupling, and security alignment with the JVM ecosystem and existing operational practices, moving integrations beyond fragility to resilience.

Matteo Rossi
on Apr 27, 2026
AI, ML & Data Engineering

Orchestrating Agentic and Multimodal AI Pipelines with Apache Camel

In this article, author Vignesh Durai discusses how agentic and multimodal AI systems can be engineered using Apache Camel and LangChain4j technologies. The key components in the solution include LLM-based reasoning, retrieval-augmented generation (RAG), and image classification.

Vignesh Durai
on Apr 24, 2026
Java

Redesigning Banking PDF Table Extraction: a Layered Approach with Java

PDF table extraction often looks easy until it fails in production. Real bank statements can be messy, with scanned pages, shifting layouts, merged cells, and wrapped rows that break standard Java parsers. This article shares how we redesigned the approach using stream parsing, lattice/OCR, validation, scoring, and selective ML to make extraction more reliable in real banking systems.

Mehuli Mukherjee
on Apr 21, 2026
Web Development

Building Production-Ready tRPC APIs: the TypeScript Alternative to Apollo Federation

This article details our migration from Apollo Federation to a TypeScript-based tRPC stack, which resulted in an 89% reduction in bugs and 67% faster response times. It also covers the mistakes we made, the unexpected performance gains, and an overview of the production architecture we use today to handle 2.4 million daily requests with 99.97% uptime.

Dinesh Kumar Elumalai
on Apr 20, 2026
Cloud

Using AWS Lambda Extensions to Run Post-Response Telemetry Flush

At Lead Bank, synchronous telemetry flushing caused intermittent exporter stalls to become user-facing 504 gateway timeouts. By leveraging AWS Lambda's Extensions API and goroutine chaining in Go, flush work is moved off the response path, returning responses immediately while preserving full observability without telemetry loss.

Melvin Philips
on Apr 15, 2026
Java

The Spring Team on Spring Framework 7 and Spring Boot 4

InfoQ recently spoke with key members of the Spring team about the significant architectural and functional advancements in Spring Framework 7 and Spring Boot 4. This conversation explores the strategic shift toward core resilience by integrating features such as retry and concurrency throttling directly into the framework, alongside the performance benefits of modularizing auto-configurations.

Karsten Silz Phil Webb Sam Brannen Rossen Stoyanchev Mark Pollack Martin Lippert Michael Minella
on Apr 13, 2026

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles