InfoQ Homepage SQL Content on InfoQ
-
DuckLake 1.0: Data Lake Format with SQL Catalog Metadata
DuckDB Labs recently released DuckLake 1.0, a data lake format that stores table metadata in a SQL database rather than across many files in object storage. The first implementation is available as a DuckDB extension and includes catalog-stored small updates, improved sorting and partitioning options, and compatibility with Iceberg-style data features.
-
Google Cloud Highlights Ongoing Work on PostgreSQL Core Capabilities
Google Cloud has outlined its recent technical contributions to PostgreSQL, emphasizing improvements in logical replication, upgrade processes, and overall system stability. The update reflects ongoing collaboration with the upstream community and focuses on enhancements to the core engine aimed at addressing scalability, replication, and operational challenges.
-
OpenAI Scales Single Primary PostgreSQL Instance to Millions of Queries per Second for ChatGPT
OpenAI described how it scaled PostgreSQL to support ChatGPT and its API platform, handling millions of queries per second for hundreds of millions of users. By running a single-primary PostgreSQL deployment on Azure with nearly 50 read replicas, optimizing query patterns, and offloading write-heavy workloads to sharded systems, OpenAI maintained low-latency reads while managing write pressure.
-
Prisma 7: Rust-Free Architecture and Performance Gains
Prisma ORM 7.0 has revolutionized the TypeScript-first ORM landscape with a Rust-free architecture, delivering 3x faster queries, 90% smaller bundles, and improved developer experience. With dynamic configurations and streamlined artifact management, Prisma enhances productivity while supporting major databases. Elevate your Node.js projects with cutting-edge performance and type safety.
-
Cloudflare Introduces Aggregations in R2 SQL for Data Analytics
Cloudflare recently announced support for aggregations in R2 SQL, a new feature that lets developers run SQL queries on data stored in R2. This enhancement expands R2 SQL beyond basic filtering and makes it more useful for analytical workloads without requiring separate data warehouse tools.
-
Memori Expands into a Full-Scale Memory Layer for AI Agents across SQL and MongoDB
Memori is an innovative, open-source memory system that empowers AI agents with structured, long-term memory using standard databases like SQL and MongoDB. It seamlessly integrates into existing frameworks, enabling efficient data extraction and retrieval without vendor lock-in. Ideal for developers, Memori's modular design ensures reliability and scalability for next-gen intelligent systems.
-
Inside Uber’s Query Architecture: Simplifying Layers and Improving Observability
Uber rebuilt its Apache Pinot query architecture, replacing the Presto-based Neutrino system with a lightweight proxy called Cellar and Pinot’s Multi-Stage Engine Lite Mode. The redesign simplifies SQL execution, improves resource management, and ensures predictable performance for large-scale analytics workloads.
-
Agoda Leverages ChatGPT in the CI/CD Process for SQL Stored Procedure Optimization
Agoda started utilizing ChatGPT to optimize SQL stored procedures (SP) as part of their CI/CD process. After introducing the automated LLM-assisted step, the company observed shortened stored procedure optimization times, which lightened the load on DB developers. Agora works on making ChatGPT more accessible for SP optimization outside of the CI/CD pipeline.
-
Neon Serverless Postgres Now Generally Available as an Azure Native Integration
Microsoft and Neon launch Neon Serverless Postgres, now generally available as a native Azure integration, offering developers an innovative, scalable, and cost-effective database solution. With automatic scaling, instant provisioning, and seamless integration, it's designed for enterprises and AI startups alike, enhancing workflows and enabling efficient database management within Azure.
-
Cloudflare Upgrades D1 Database with Global Read Replication
During the recent Developer Week 2025, Cloudflare announced the beta of global read replication for its serverless SQL database D1, providing a globally distributed option without sacrificing consistency. With automatically provisioned replicas in every region, applications can now serve read queries faster while maintaining strong sequential consistency across requests.
-
Google Cloud Introduces HDD Tier for Spanner Database, Cutting Cold Storage Costs by 80%
Google has recently introduced tiered storage for Spanner, its distributed SQL database on Google Cloud. This tiered storage is based on a new HDD storage option that is 80% cheaper than the existing SSD option, allowing for cost optimization of older data while minimizing the overhead associated with traditional data migration.
-
Google Cloud Launches Gen AI Toolbox for Databases
Google Cloud has announced the public beta launch of Gen AI Toolbox for Databases, an open-source server developed in collaboration with LangChain. This new tool is designed to help developers seamlessly integrate production-grade, agent-based generative AI applications with databases while ensuring secure access, scalability, and observability.
-
Microsoft Announces General Availability of Fabric API for GraphQL
Microsoft has launched Fabric API for GraphQL, moving the data access layer from public preview to general availability (GA). This release introduces several enhancements, including support for Azure SQL and Fabric SQL databases, saved credential authentication, detailed monitoring tools, and integration with CI/CD workflows.
-
Timescale Bolsters AI-Ready PostgreSQL with pgai Vectorizer
Timescale recently expanded its PostgreSQL AI offerings with pgai Vectorizer. This update enables developers to create, store, and manage vector embeddings alongside relational data without the need for external tools or additional infrastructure.
-
How Uber Sped up SQL-based Data Analytics with Presto and Express Queries
Uber uses Presto, an open-source distributed SQL query engine, to provide analytics across several data sources, including Apache Hive, Apache Pinot, MySQL, and Apache Kafka. To improve its performance, Uber engineers explored the advantages of dealing with quick queries, a.k.a. express queries, in a specific way and found they could improve both Presto utilization and response times.