InfoQ Homepage Programming Content on InfoQ

Articles

RSS Feed

Newer Older

Java

Building a RAG Application with Spring Boot, Spring AI, MongoDB Atlas Vector Search, and OpenAI

The RAG paradigm redefines AI: it combines generative models and business data for accurate, contextualised responses. The article shows how to integrate Spring Boot, Spring AI, MongoDB Atlas and OpenAI into a powerful and flexible pipeline capable of transforming the way businesses access and create value from data, with applications ranging from finance and healthcare to customer service.

Matteo Rossi
on Oct 27, 2025
Culture & Methods

A Plan-Do-Check-Act Framework for AI Code Generation

AI code generation tools promise faster development but often create quality issues, integration problems, and delivery delays. A structured Plan-Do-Check-Act cycle can maintain code quality while leveraging AI capabilities. Through working agreements, structured prompts, and continuous retrospection, it asserts accountability over code while guiding AI to produce tested, maintainable software.

Ken Judy
on Oct 20, 2025
Development

Go Channels: Understanding Happens-Before for Safe Concurrency

This article dives into the happens-before semantics of Go channels, explaining how they relate to memory visibility, synchronization, and concurrency correctness. We'll examine subtle pitfalls, illustrate them with examples, and explore the architectural implications for system designers.

Gabor Koos
on Oct 13, 2025
Culture & Methods

Exploring the Unintended Consequences of Automation in Software

This article lays out some of the common assumptions and misconceptions about automation and its role in software (and software incidents), what our research has found regarding how automation shows up in software incidents, and some ideas around how people can better design automated tools to help people better handle software incidents.

Courtney Nash
on Oct 10, 2025
Java

Bringing AI Inference to Java with ONNX: a Practical Guide for Enterprise Architects

Java applications can now run transformer-based AI models directly within the JVM—without Python, REST wrappers, or microservices. This guide shows how to integrate ONNX-powered inference with tokenizer support, GPU acceleration, modular deployment, and observability, enabling architects in regulated domains to adopt AI without disrupting compliance or CI/CD workflows.

Syed Danish Ali
on Oct 03, 2025
Architecture & Design

A Pipeline Approach to Language Migrations

Automated language migrations can be made reliable and maintainable by structuring them as pipelines with clear, testable stages. This avoids the pitfalls of big-bang rewrites while providing transparency and modularity. The pipeline approach ensures idiomatic code, preserves legacy business logic, and supports large-scale transformations from outdated systems.

Federico Tomassetti
on Oct 01, 2025
AI, ML & Data Engineering

Disaggregation in Large Language Models: the Next Evolution in AI Infrastructure

Large Language Model (LLM) inference faces a fundamental challenge: the same hardware that excels at processing input prompts struggles with generating responses, and vice versa. Disaggregated serving architectures solve this by separating these distinct computational phases, delivering throughput improvements and better resource utilization while reducing costs.

Anat Heilper
on Sep 29, 2025
AI, ML & Data Engineering

InfoQ AI, ML and Data Engineering Trends Report - 2025

This InfoQ Trends Report offers readers a comprehensive overview of emerging trends and technologies in the areas of AI, ML, and Data Engineering. This report summarizes the InfoQ editorial team’s and external guests' view on the current trends in AI and ML technologies and what to look out for in the next 12 months.

Srini Penchikala Savannah Kunovsky Anthony Alford Daniel Dominguez Vinod Goje
on Sep 24, 2025
Architecture & Design

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust

At times, to evolve your product, you need to rebuild it from scratch. The article provides the story behind the rewrite of InfluxDB from scratch using a different programming language - Rust - and stack - Apache Flight, Data Fusion, Apache Arrow and Parquet (FDAP). It emphasises the benefits, as well as the mechanics behind its operation and the different versions of the product.

Paul Dix
on Sep 10, 2025
Culture & Methods

Virtual Panel: How Software Engineers and Team Leaders Can Excel with Artificial Intelligence

Artificial intelligence is impacting the individual work of software developers, how professionals work together in teams, and how software teams are being managed. In this panel, we'll discuss how artificial intelligence is reshaping software development, and what mindset and skills are required for software developers and engineering leaders to become adaptable and resilient in the age of AI.

Ben Linders Courtney Nash Mandy Gu Hien Luu
on Sep 05, 2025
AI, ML & Data Engineering

Effective Practices for Architecting a RAG Pipeline

Hybrid search, smart chunking, and domain-aware indexing are key to building effective RAG pipelines. Context window limits and prompt quality critically affect LLM response accuracy. This article provides lessons learned from setting up a RAG pipeline.

Glenn Engstrand
on Sep 03, 2025
DevOps

How Causal Reasoning Addresses the Limitations of LLMs in Observability

Large language models excel at converting observability telemetry into clear summaries but struggle with accurate root cause analysis in distributed systems. LLMs often hallucinate explanations and confuse symptoms with causes. This article suggests how causal reasoning models with Bayesian inference offer more reliable incident diagnosis.

Dhairya Dalal
on Sep 02, 2025

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles