InfoQ Homepage Apache Kafka Content on InfoQ
-
Kestra: a Scalable Open-Source Orchestration and Scheduling Platform
Kestra, a new open-source orchestration and scheduling platform, helps developers to build, run, schedule, and monitor complex pipelines. The concept of a workflow, called Flow in Kestra, is at the heart of the platform. It is a list of tasks defined with a descriptive language based on yaml.
-
Real-Time Exactly-Once Event Processing at Uber with Apache Flink, Kafka, and Pinot
Uber faced some challenges after introducing ads on UberEats. The events they generated had to be processed quickly, reliably and accurately. These requirements were fulfilled by a system based on Apache Flink, Kafka, and Pinot that can process streams of ad events in real-time with exactly-once semantics. An article describing its architecture was published recently in the Uber Engineering blog.
-
Data Collection, Standardization and Usage at Scale in the Uber Rider App
Uber Engineering recently published how it collects, standardises and uses data from the Uber Rider app. Rider data comprises all the rider's interactions with the Uber app. This data accounts for billions of events from Uber's online systems every day. Uber uses this data to deal with top problem areas such as increasing funnel conversion, user engagement, etc.
-
Netflix Builds a Reliable, Scalable Platform with Event Sourcing, MQTT and Alpakka-Kafka
Netflix recently published a blog post detailing how it built a reliable device management platform using an MQTT-based event sourcing implementation. To scale its solution, Netflix utilizes Apache Kafka, Alpakka-Kafka and CockroachDB.
-
Microsoft Announces Event Hubs Premium in Preview
Azure Event Hubs is Microsoft’s managed real-time event ingestion service designed to serve demanding big data streaming and event ingestion needs in the Cloud. Microsoft announced the public preview of Event Hubs Premium during the annual Build conference as a new product SKU tailor-made for high-end event streaming scenarios requiring elastic, superior performance with predictable latency.
-
Confluent Announces Confluent for Kubernetes into General Availability
Recently, Confluent announced the general availability (GA) of Confluent for Kubernetes, a complete, declarative API-driven experience for deploying and self-managing Confluent Platform as a cloud-native system. With Confluent for Kubernetes, the company packages its event-streaming platform into Kubernetes and provides a Cloud-Native offering.
-
Airbnb Builds Himeji - a Scalable Centralized Authorization System
Airbnb recently described how it built Himeji, a scalable centralized authorization system. Himeji stores permissions data and performs permission checks as a central source of truth. It uses a sharded and replicated in-memory cache to improve performance and lower latencies and has served checks in production for about a year.
-
Confluent Announces Strategic Alliance with Microsoft
Confluent, the company of the founders of Apache Kafka, recently announced a new strategic alliance between them and Microsoft to enable a more integrated experience between Confluent Cloud and the Azure platform.
-
Uber Implements Disaster Recovery for Multi-Region Kafka
In a recent blog post, Uber engineers highlight how they use a replication platform to implement disaster recovery at scale with a multi-region Kafka deployment. Uber has a large deployment of Apache Kafka, processing trillions of messages and multiple petabytes of data per day. Uber's engineers provided business resilience and continuity in the face of natural and human-made disasters.
-
LinkedIn Migrates away from Lambda Architecture to Reduce Complexity
Software engineers from LinkedIn recently published how they migrated away from a Lambda architecture. The Lambda architecture implementation caused their solution to have high operational overhead and added complexity, leading to slow product iteration times. As a result, the engineers chose to migrate to a Lambda-less architecture, resulting in significant development velocity improvements.
-
Infinite Storage & Retention for Apache Kafka in Confluent Cloud
Confluent, Inc. recently announced the Infinite Storage option for its standard and dedicated clusters. This offering is a part of the Project Metamorphosis initiative, which is focused on enabling Kafka with modern cloud properties. Organizations can have a centralized platform for all event data for real-time actioning and historical analysis with limitless storage and retention.
-
KSQL Now Available on Confluent Cloud
KSQL is the streaming SQL engine for Apache Kafka, and it is currently available as a fully-managed service on the Confluent Cloud Platform for all its customers on usage-based billing plans. In a recent blog post, Confluent announced the availability of Confluent Cloud KSQL.
-
Confluent Offers Apache Kafka as a Service on the Azure Marketplace
In a recent blog post, Confluent announced the general availability of Confluent Cloud on Microsoft Azure. Confluent Cloud is a fully managed Apache Kafka service that removes the burden of operationally managing Kafka for engineers.
-
Experience Using Event Streams, Kafka and the Confluent Platform at Deutsche Bahn
To provide trip information to their rail passengers, Deutsche Bahn (DB) has created the RI-Plattform (Passenger Information Application) based on Apache Kafka and Kafka Streams with a plan to feed all information channels through the system. In a blog post, Axel Löhn and Uwe Eisele describe the microservices based design, how they build and run the system, and their experience from production.
-
Confluent Offers Apache Kafka as a Service on the GCP Marketplace
In a recent blog post, Confluent announced the general availability of Confluent Cloud on the Google Cloud Platform (GCP) Marketplace. Confluent Cloud is a fully managed Apache Kafka service, which removes the burden of its users to manage Kafka themselves.