InfoQ Homepage Database Content on InfoQ

Articles

RSS Feed

Newer Older

AI, ML & Data Engineering

Reducing False Positives in Retrieval-Augmented Generation (RAG) Semantic Caching: a Banking Case Study

In this article, author Elakkiya Daivam discusses why Retrieval Augmented Generation (RAG) and semantic caching techniques are powerful levers for reducing false positives in AI powered applications. She shares the insights from a production-grade evaluation with 1,000 query variations tested across seven bi-encoder models.

Elakkiya Daivam
on Nov 14, 2025
Java

Building a RAG Application with Spring Boot, Spring AI, MongoDB Atlas Vector Search, and OpenAI

The RAG paradigm redefines AI: it combines generative models and business data for accurate, contextualised responses. The article shows how to integrate Spring Boot, Spring AI, MongoDB Atlas and OpenAI into a powerful and flexible pipeline capable of transforming the way businesses access and create value from data, with applications ranging from finance and healthcare to customer service.

Matteo Rossi
on Oct 27, 2025
AI, ML & Data Engineering

InfoQ AI, ML and Data Engineering Trends Report - 2025

This InfoQ Trends Report offers readers a comprehensive overview of emerging trends and technologies in the areas of AI, ML, and Data Engineering. This report summarizes the InfoQ editorial team’s and external guests' view on the current trends in AI and ML technologies and what to look out for in the next 12 months.

Srini Penchikala Savannah Kunovsky Anthony Alford Daniel Dominguez Vinod Goje
on Sep 24, 2025
Architecture & Design

Engineering a Time Series Database Using Open Source: Rebuilding InfluxDB 3 in Apache Arrow and Rust

At times, to evolve your product, you need to rebuild it from scratch. The article provides the story behind the rewrite of InfluxDB from scratch using a different programming language - Rust - and stack - Apache Flight, Data Fusion, Apache Arrow and Parquet (FDAP). It emphasises the benefits, as well as the mechanics behind its operation and the different versions of the product.

Paul Dix
on Sep 10, 2025
Java

Jakarta EE 11 Overview: Virtual Threads, Records, and the Future of Persistence

Jakarta EE 11 delivers enhancements that include support for Java 17 and 21, integration with Java records and virtual threads, and the introduction of the Jakarta Data specification for unified SQL and NoSQL persistence. This release simplifies enterprise Java and establishes the groundwork for Jakarta EE 12, which will advance capabilities in data management.

Otavio Santana
on Jul 29, 2025
Architecture & Design

Optimizing Search Systems: Balancing Speed, Relevance, and Scalability

Innovative software engineer focused on optimizing search performance in dynamic environments. This article highlights key strategies from our QCon San Francisco 2024 presentation, addressing challenges faced by platforms like Uber Eats in data indexing and retrieval. Our advancements ensure swift, relevant user experiences amidst ever-growing datasets.

Janani Narayanan Karthik Ramasamy
on Jul 16, 2025
Architecture & Design

Shadow Table Strategy for Seamless Service Extractions and Data Migrations

The shadow table strategy creates a synchronized duplicate of the data that keeps the production system fully operational during changes, enabling zero-downtime migrations. The approach supports diverse scenarios - including database migrations, microservices extractions, and incremental schema refactoring - that update live systems safely and progressively.

Apoorv Mittal
on Apr 09, 2025
AI, ML & Data Engineering

Bridging Modalities: Multimodal RAG for Advanced Information Retrieval

In this article, the authors discuss how multi-model retrieval augmented generation (RAG) techniques can enhance AI by integrating multiple modalities like text, images, and audio for deeper contextual understanding, with help of a practical example of a healthcare application.

Suruchi Shah Suraj Dharmapuram
on Apr 07, 2025
Development

How to Compute without Looking: a Sneak Peek into Secure Multi-Party Computation

This article shows how you can compute a function across multiple parties that do not trust each other without forcing them to share their individual inputs. This technique can be used to split secrets among parties, perform logical operations, or count votes in a way that ensures data privacy is preserved.

Debasish Ray Chawdhuri
on Mar 31, 2025
Java

Reactive Real-Time Notifications with SSE, Spring Boot, and Redis Pub/Sub

Explore the power of reactive programming for building scalable real-time notification systems. Using Spring Boot Reactive and Spring WebFlux, leverage non-blocking operations to handle high-volume, asynchronous data flows efficiently. Discover how Redis Pub/Sub enables event-driven messaging and how the SSE protocol provides persistent connections for instant client updates without polling.

Matteo Rossi
on Nov 21, 2024
Cloud

Optimizing Wellhub Autocomplete Service Latency: a Multi-Region Architecture

Every company wants fast, reliable, and low-latency services. Achieving these goals requires significant investment and effort. In this article, I will share how Wellhub invested in a multi-region architecture to achieve a low-latency autocomplete service.

Matheus Felisberto
on Oct 17, 2024
Java

Modernizing Testing Practices for Jakarta EE Projects

This article focuses on the increasing adoption of data-driven testing in Java enterprise applications and sheds light on the Data and NoSQL Jakarta specifications. It highlights the significance of modern testing libraries such as JUnit Jupiter and AssertJ and emphasizes the importance of container-based frameworks like Testcontainers in enhancing testing practices.

Otavio Santana
on Apr 10, 2024

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles