Apache Kafka Reaches 1.0

The Apache Software Foundation has announced Apache Kafka 1.0, adding an improved Streams API, enhanced metrics, improved tolerance for disk failures, general bug fixes, and more.

Apache Kafka is an open distributed streaming platform, used by thousands of companies worldwide. Some enhancements in this release include:

Various improvements to the Streams API. These include a new API to expose the state of active tasks at runtime, an improved builder API, a new cogroup API, and improved debuggability
A wide number of improvements to metrics, such a new health checking and a global topic and partition count
Java 9 support, bringing faster TLS and CRC32 implementations. This leads to faster over-the-wire encryption
Improved authentication error handling
Better tolerance for disk failures. Now a single disk failure in a JBOD broker will no longer bring the entire broker down

Despite already being in widespread use, this is the first major release milestone for Apache Kafka. Neha Narkhede, co-creator of Kafka, explains why:

For Apache Kafka, the wait for 1.0 was less about stability and more about completeness of the vision that we and the community set to build towards back when we first created Kafka. After all, Kafka has been in production at thousands of companies for several years.

Specifically, Narkhede outlines this vision:

So that is the vision we had in mind and what we set out to build towards – a Streaming Platform; the ability to read, write, move and process streams of data with transactional correctness at company-wide scale.

Narkhede also explains the iterations that Kafka has gone through in order to achieve this vision. These have included:

Introducing a log like abstraction for continuous streams, where publishing is appending to an ordered log, and consuming is reading continuously from a given offset.
Adding replication and fault tolerance to logs
Introduction of Connect and Streams APIs used to make it easy to get data out of Kafka and process it
Exactly-once semantics for stream processing through transactions

Apache Kafka is available for download, and the full blog post from Narkhede is available to read online, outlining the full Kafka journey from conception to now.

Topics

Pitfalls of Unified Memory Models in GPUs

Evolving Trainline Architecture for Scale, Reliability and Productivity

Generally AI - Season 2 - Episode 3: Surviving the AI Winter

Mastering Observability: Unlocking Customer Insights with Gojko Adzic

Proactive Approaches to Securing Linux Systems and Engineering Applications

Helpful links

Choose your language

Write for InfoQ

Rate this Article

This content is in the AI, ML & Data Engineering topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

Microsoft Introduces Drasi: Open-Source System for Real-Time Event Processing and Automation

How Cell-Based Architecture Enhances Modern Distributed Systems

Article Series: Cell-Based Architectures: How to Build Scalable and Resilient Systems

Orchestrating a Path to Success - a Conversation with Bernd Ruecker

OpenAI Releases Swarm, an Experimental Open-Source Framework for Multi-Agent Orchestration

Generally AI - Season 2 - Episode 3: Surviving the AI Winter

Challenges and Lessons Porting Code from C to Rust

Copilot Now Available in OneDrive: AI-Powered Features for Streamlined Document Management

Ephemeral IDs: Cloudflare's Latest Tool for Fraud Detection

Evolving Trainline Architecture for Scale, Reliability and Productivity

Taking Advantage of Cell-Based Architectures to Build Resilient and Fault-Tolerant Systems

No EC2 or Kubernetes Allowed: Insights from Building Serverless-Only Architecture at PostNL

Mastering Observability: Unlocking Customer Insights with Gojko Adzic

How a Sustainable Mindset in Software Engineering Can Increase Team Performance and Prevent Burnout

The Ongoing Challenges of DevSecOps Transformation and Improving Developer Experience

University Researchers Publish Analysis of Chain-of-Thought Reasoning in LLMs

Microsoft and Tsinghua University Present DIFF Transformer for LLMs

OpenAI Releases Swarm, an Experimental Open-Source Framework for Multi-Agent Orchestration

Google Cloud Adds Scalable Vector Search to Memorystore for Valkey & Redis Cluster

Podman Desktop 1.13 Launches with Hyper-V Support and Additional Enhancements

Uber Completes Major MySQL Fleet Upgrade, Boosting Performance and Security

QCon San Francisco

QCon London

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?