InfoQ Homepage Database Content on InfoQ
-
Netflix Tackles Data Deletion at Scale with Centralized Platform Architecture
Netflix engineers presented their architecture for a centralized data-deletion platform at QCon San Francisco, addressing a critical yet rarely discussed system design challenge. The platform manages deletion across heterogeneous data stores while balancing durability, availability, and correctness, processing 76.8 billion row deletions across 1,300 datasets with zero data loss incidents.
-
Parting the Clouds: the Rise of Disaggregated Systems by Murat Demirbas at QCon SF 2025
Cloud computing is evolving through disaggregation, addressing inefficiencies of traditional architectures by decoupling compute and storage. This shift enhances scalability, fault isolation, and operational simplicity, driven by advancements in networking. As seen in cloud databases such as Amazon Aurora, embracing these principles enables true economic optimization and innovative design.
-
Inside Uber’s Query Architecture: Simplifying Layers and Improving Observability
Uber rebuilt its Apache Pinot query architecture, replacing the Presto-based Neutrino system with a lightweight proxy called Cellar and Pinot’s Multi-Stage Engine Lite Mode. The redesign simplifies SQL execution, improves resource management, and ensures predictable performance for large-scale analytics workloads.
-
Amazon Timestream for InfluxDB Adds Support for InfluxDB 3 Core and Enterprise
InfluxData has launched InfluxDB 3 Core and Enterprise on Amazon Timestream, offering a high-speed, open-source time-series database for real-time applications. With enhanced security, scalability, and performance, developers can seamlessly integrate with AWS services. InfluxDB 3 redefines data management for AI-driven environments, enabling rapid analytics and decision-making.
-
Airbnb’s Mussel V2: Next-Gen Key Value Storage to Unify Streaming and Bulk Ingestion
Airbnb’s engineering team re-architected its internal key-value storage system, Mussel, to unify streaming and bulk ingestion while simplifying operations, achieving over 100,000 writes per second and sub-25ms read latencies on 100-terabyte tables, while leveraging Kubernetes, Kafka, and a NewSQL backend to improve scalability, reliability, and operational efficiency across its internal services.
-
Apollo GraphQL Client 4.0 Released with Leaner Bundles and Strengthened TypeScript Safety
Apollo GraphQL has launched Apollo Client 4.0, featuring a streamlined architecture, enhanced TypeScript support, and modular APIs to reduce bundle sizes by 20-30%. Key updates include opt-in features, improved error handling, and a decoupled core library, making it easier for developers to manage GraphQL operations across various frameworks. Feedback has been overwhelmingly positive.
-
How Netflix Powers Audience Insights at Trillion-Row Scale
In a recent blog post, Netflix engineers described how they scaled Muse, the company’s internal application for data-driven creative insights, to handle trillion-row datasets.
-
Google Spanner Unifies OLTP and OLAP with Columnar Engine
Google Spanner now features a columnar engine, allowing its distributed database to handle both OLTP and OLAP workloads on a single platform. This hybrid architecture eliminates the need for separate data warehouses and ETL pipelines. The engine's columnar storage and vectorized execution accelerate analytical queries up to 200x on live data, which is especially beneficial for AI applications.
-
TanStack DB Enters Beta with Reactive Queries, Optimistic Mutations, and Local-First Sync
Introducing TanStack DB: a groundbreaking embedded client-side database that revolutionizes frontend development. With features like reactive queries, typed collections, and optimistic mutations, TanStack DB simplifies state management, ensuring blazing-fast updates. Easily integrate with existing TanStack Query applications in an open-source, beta format.
-
Amazon DocumentDB Serverless: Auto-Scaling Database Solution for Variable Workloads
AWS has launched Amazon DocumentDB Serverless, an auto-scaling database solution compatible with MongoDB, tailored for variable workloads. While marketed as "serverless," it functions more like auto-scaling, charging from $30/month. Ideal for enterprises and SaaS vendors, it adeptly handles spikes in demand, particularly for AI-driven applications.
-
Yearly MariaDB LTS Release Integrates Vector Search
MariaDB has recently released MariaDB Community Server 11.8 as generally available, its yearly long-term support (LTS) release for 2025. The new release introduces integrated vector search capabilities for AI-driven and similarity search applications, enhanced JSON functionality, and temporal tables for data history and auditing.
-
HTAP: the Rise and Fall of Unified Database Systems?
A recent article by Zhou Sun sparked a debate in the data community about the future of HTAP systems. Hybrid transaction/analytical processing was meant to help integrate historical and online data at scale, supporting more flexible query methods and reducing business complexity.
-
Amazon Aurora DSQL Goes GA: Distributed, PostgreSQL-Compatible Serverless Database
Amazon has recently announced the general availability of Amazon Aurora DSQL, a PostgreSQL-compatible, serverless, and distributed database. The new managed service is designed to support active-active high availability and multi-region strong consistency.
-
QCon London 2025: How to Build a Database without a Server
Alex Seaton, staff engineer at Man Group, presented “How to Build a Database Without a Server” at QCon London 2025. Seaton demonstrated how they migrated an older hedge fund trading system application using a cluster farm that was difficult to maintain to an application using a serverless database and Conflict-Free Replicated Data Types (CRDTs).
-
Cloudflare Upgrades D1 Database with Global Read Replication
During the recent Developer Week 2025, Cloudflare announced the beta of global read replication for its serverless SQL database D1, providing a globally distributed option without sacrificing consistency. With automatically provisioned replicas in every region, applications can now serve read queries faster while maintaining strong sequential consistency across requests.