InfoQ Homepage Change Data Capture Content on InfoQ

News

RSS Feed

Architecture & Design

Pinterest’s CDC-Powered Ingestion Slashes Database Latency from 24 Hours to 15 Minutes

Pinterest launched a next-generation CDC-based database ingestion framework using Kafka, Flink, Spark, and Iceberg. The system reduces data availability latency from 24+ hours to 15 minutes, processes only changed records, supports incremental updates and deletions, and scales to petabyte-level data across thousands of pipelines, optimizing cost and efficiency.

Leela Kumili
on Feb 26, 2026
Architecture & Design

Uber Achieves 150M Reads per Second with CacheFront Improvements

Uber has updated its CacheFront architecture to handle over 150 million reads per second. The new design improves consistency and reduces stale reads by integrating Flux for MySQL binlog tailing, enhancing the storage engine, and introducing Cache Inspector for monitoring and optimization.

Leela Kumili
on Oct 06, 2025
Architecture & Design

Uber's CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency

Uber developed an innovative caching solution, CacheFront, for its in-house distributed database, Docstore. CacheFront enables over 40M reads per second from online storage and achieves substantial performance improvements, including a 75% reduction in P75 latency and over 67% reduction in P99.9 latency, demonstrating its effectiveness in enhancing system efficiency and scalability.

Eran Stiller
on Feb 29, 2024
Architecture & Design

Netflix Creates Incremental Processing Solution Using Maestro and Apache Iceberg

Netflix created a new solution for incremental processing in its data platform. The incremental approach reduces the cost of computing resources and execution time significantly as it avoids processing complete datasets. The company used its Maestro workflow engine and Apache Iceberg to improve data freshness and accuracy and plans to provide managed backfill capabilities.

Rafał Gancarz
on Jan 15, 2024
Architecture & Design

Distributed Materialized Views: How Airbnb’s Riverbed Processes 2.4 Billion Daily Events

Airbnb created Riverbed, a Lambda-like data framework for producing and managing distributed materialized views. The framework supports over 50 read-heavy use cases where data is sourced from multiple data sources within the company’s service-oriented architecture (SOA) platform. It uses Apache Kafka and Apache Spark for online and offline components, respectively.

Rafał Gancarz
on Oct 04, 2023
Architecture & Design

Yelp Rebuilds Corrupted Cassandra Cluster Using Its Data Streaming Architecture

Yelp created a solution to sanitize data from the corrupted Apache Cassandra cluster utilizing its data streaming architecture. The team explored many potential options to address the data corruption issue, however, ultimately had to move the data into a new cluster to remove corrupted records in the process.

Rafał Gancarz
on Jul 17, 2023
Java

Debezium Releases Version 2.0 of Its Change Data Capture Tool

Debezium, an open-source distributed platform for change data capture (CDC), converts records from existing databases into event streams, enabling applications to detect and respond to database row-level changes. This release of version 2.0 introduces many changes: Java 11 is now required; incremental snapshots are improved [...]

Andrea Messetti
on Nov 09, 2022
Architecture & Design

Netflix Studio Search: Using Elasticsearch and Apache Flink to Index Federated GraphQL Data

Netflix engineers recently published how they built Studio Search, using Apache Kafka streams, an Apache Flink-based Data Mesh process, and Elasticsearch to manage the index. They designed the platform to take a portion of Netflix's federated GraphQL graph and make it searchable. Today, Studio Search powers a significant portion of the user experience for many applications within the organisation.

Eran Stiller
on Apr 19, 2022
Architecture & Design

Uber Re-Architected Its Foundational Fulfilment Service

Uber recently shared how it re-architected its fulfilment service, one of Uber's foundational platform services. Following a two-year-long effort involving 30+ teams and hundreds of developers, Uber engineers "built a strong foundation for modelling various types of physical fulfilment categories in the new platform and migrated all existing transportation use cases."

Eran Stiller
on Aug 10, 2021

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Million PDFs: Building a Modern Document Infrastructure with Rust and Typst

Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix

Graph RAG: Building Smarter Retrieval Workflows with Knowledge Graphs

Craig McLuckie on Culture as a Team's Operating System in the AI Era

The Time it Wasn't DNS

Helpful links

Choose your language

News

Pinterest’s CDC-Powered Ingestion Slashes Database Latency from 24 Hours to 15 Minutes

Uber Achieves 150M Reads per Second with CacheFront Improvements

Uber's CacheFront: Powering 40M Reads per Second with Significantly Reduced Latency

Netflix Creates Incremental Processing Solution Using Maestro and Apache Iceberg

Distributed Materialized Views: How Airbnb’s Riverbed Processes 2.4 Billion Daily Events

Yelp Rebuilds Corrupted Cassandra Cluster Using Its Data Streaming Architecture

Debezium Releases Version 2.0 of Its Change Data Capture Tool

Netflix Studio Search: Using Elasticsearch and Apache Flink to Index Federated GraphQL Data

Uber Re-Architected Its Foundational Fulfilment Service

Million PDFs: Building a Modern Document Infrastructure with Rust and Typst

Rust at the Core - Accelerating Polyglot SDK Development

Anthropic Lead: HTML Increasingly Better Than Markdown at Keeping Humans Engaged in Agentic Loops

Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix

Instacart Scales Personalized Marketing via Configuration-Driven Multi-Tenant Platform

Inside Target’s LLM-Based System for Semantic Matching in Marketing Forecast Pipelines

Shifting Platform Development from Projects to Products

Building a European Cloud Orchestration Platform within an Enterprise

How Lightweight ADRs and Architectural Advice Forums Can Support Architectural Decisions

Graph RAG: Building Smarter Retrieval Workflows with Knowledge Graphs

The Infrastructure Challenge Behind Production AI

Trustworthy Productivity: Securing AI-Accelerated Development

Microsoft Brings AI-Powered Vulnerability Remediation to Azure DevOps with Copilot Autofix

AI Tools Accelerates Coding, But Not Overall Software Delivery, GitLab Research Finds

AWS Introduces Workload Credentials Provider for Automated Certificate and Secret Management

Online InfoQ AI Engineering Certification

Online InfoQ Architect Certification

Online InfoQ AI Security & Privacy Engineering Program

QCon San Francisco

QCon London 2027

InfoQ Software Architects' Newsletter

News