InfoQ Homepage Database Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Version Controlled SQL Database Dolt Releases 2.0 with Automatic Storage Cleanup and Compression

DoltHub has recently released Dolt 2.0, a major update to the open source version-controlled SQL database. The latest major version adds automatic storage optimization, including garbage collection and compression, along with improved support for large and vector data types.

Renato Losio
on Jul 18, 2026
Cloud

AlloyDB Ships Proxy Models That Replace LLM Calls with Local Inference inside the Database

Google shipped AlloyDB AI functions GA with a proxy model architecture that trains a lightweight local model from LLM outputs, then runs queries at database speed without external calls. Smart batching delivers 2,400x throughput improvement. The proxy model reaches 100,000 rows per second in preview, but benchmark numbers apply only to ai.if in internal testing.

Steef-Jan Wiggers
on Jul 09, 2026
Architecture & Design

Netflix Cuts Cassandra Read Latency from Seconds to Milliseconds with Dynamic Partition Splitting

Netflix engineers introduced dynamic partition splitting for Cassandra to address wide partitions in time series workloads. The metadata-driven approach detects oversized partitions, splits them smaller units, and routes reads across child partitions. Netflix reported lower read latency from seconds to milliseconds, reduced timeouts, and improved cluster stability while maintaining transparency.

Leela Kumili
on Jul 06, 2026
AI, ML & Data Engineering

AWS Open Sources ExtendDB to Run DynamoDB on PostgreSQL

AWS recently announced ExtendDB, a DynamoDB-compatible adapter that lets developers use the DynamoDB API with different storage backends, starting with PostgreSQL. The project supports existing SDKs and tools without modification, giving teams greater flexibility to run DynamoDB-style workloads outside of native DynamoDB while maintaining compatibility with current applications and workflows.

Renato Losio
on Jun 07, 2026
AI, ML & Data Engineering

Cloudflare Identifies Query Planning Bottleneck in ClickHouse

Cloudflare recently described how a slowdown in its billing pipeline was traced to contention inside the query planning stage of ClickHouse. The team profiled the bottleneck and patched ClickHouse to replace an exclusive lock with a shared lock, drop the per-query copy of the parts list, and improve part filtering.

Renato Losio
on Jun 06, 2026
Web Development

TypeORM Reaches 1.0 after Nearly a Decade, Signalling Renewed Maintenance

TypeORM 1.0 is the first major release of the open-source TypeScript and JavaScript ORM since its inception in 2016. This version modernizes platform requirements, removes deprecated APIs, and introduces numerous bug fixes and new features. TypeORM now supports ECMAScript 2023, dropping older Node.js versions and dependencies while enhancing security and migration processes.

Daniel Curtis
on Jun 05, 2026
DevOps

Discord Rebuilds Database Operations around Automation to Manage ScyllaDB at Massive Scale

Discord has detailed how it rebuilt its database operations around a new internal orchestration framework called the Scylla Control Plane (SCP), enabling its small infrastructure team to automate large-scale ScyllaDB cluster management tasks that previously took days of manual work.

Craig Risi
on May 22, 2026
DevOps

DBmaestro MCP Server Puts Natural Language in Control of Database Pipelines

DBmaestro has launched an MCP server that connects AI agents and enterprise copilots to its database DevOps platform, allowing teams to issue natural language commands that trigger real, governed platform workflows. The MCP server, announced on 7 April 2026, allows DBAs to expose DBmaestro's release automation, source control, CI/CD orchestration, and compliance capabilities through MCP.

Matt Saunders
on Apr 30, 2026
Architecture & Design

Uber’s Hive Federation Decentralizes 16K Datasets and 10+ PB for Zero-Downtime Analytics at Scale

Uber has decentralized its Hive data warehouse, migrating 16,000 datasets totaling over 10 petabytes using pointer-based federation. The migration ensures zero downtime, strict ACL enforcement, improved governance, and scalable, domain-specific datasets for analytics and machine learning workloads.

Leela Kumili
on Apr 09, 2026
Architecture & Design

Cloudflare and ETH Zurich Outline Approaches for AI-Driven Cache Optimization

Cloudflare and ETH Zurich highlight how AI-driven crawler traffic challenges traditional caching in CDNs and databases. They propose AI-aware strategies including separate cache tiers, adaptive algorithms, and pay-per-crawl models to balance performance for human users and AI services while maintaining cache efficiency and system stability.

Leela Kumili
on Apr 08, 2026
AI, ML & Data Engineering

TigerFS Mounts PostgreSQL Databases as a Filesystem for Developers and AI Agents

TigerFS is a new experimental filesystem that mounts a database as a directory and stores files directly in PostgreSQL. The open source project exposes database data through a standard filesystem interface, allowing developers and AI agents to interact with it using common Unix tools such as ls, cat, find, and grep, rather than via APIs or SDKs.

Renato Losio
on Apr 04, 2026
AI, ML & Data Engineering

ProxySQL Introduces Multi-Tier Release Strategy with Stable, Innovative, and AI Tracks

ProxySQL 3.0.6 was recently released, along with a new multi-tier release strategy. The Stable Tier focuses on reliability and production use, the Innovative Tier introduces newer features earlier, and the AI/MCP Tier explores future capabilities, including AI integrations.

Renato Losio
on Mar 29, 2026
Architecture & Design

Uber Launches IngestionNext: Streaming-First Data Lake Cuts Latency and Compute by 25%

Uber launches IngestionNext, a streaming-first data lake ingestion platform that reduces data latency from hours to minutes and cuts compute usage by 25%. Built on Kafka, Flink, and Apache Hudi, it supports thousands of datasets, enabling faster analytics, experimentation, and machine learning workloads globally.

Leela Kumili
on Mar 25, 2026
Cloud

AWS Expands Aurora DSQL with Playground, New Tool Integrations, and Driver Connectors

Amazon has announced several updates for Aurora DSQL, focusing on usability, integrations, and developer tooling. The improvements include a new interactive Aurora DSQL Playground that lets developers explore and experiment with the database directly in the browser, without registration or associated costs.

Renato Losio
on Mar 22, 2026
Cloud

QCon London 2026: How to Run on Three Clouds at Once, and When Not to

Form3 runs UK bank payments across three clouds simultaneously. At QCon London, their engineers explained how they built their custom Kubernetes operators, cross-cloud DNS tricks, and distributed databases, and what happened when they tried to sell them in America. Spoiler: US customers wanted East/West failover, not triple-active multi-cloud.

Steef-Jan Wiggers
on Mar 16, 2026

Newer News

Older News

InfoQ Software Architects' Newsletter

News