InfoQ Homepage Apache Content on InfoQ
-
Inside Uber’s Query Architecture: Simplifying Layers and Improving Observability
Uber rebuilt its Apache Pinot query architecture, replacing the Presto-based Neutrino system with a lightweight proxy called Cellar and Pinot’s Multi-Stage Engine Lite Mode. The redesign simplifies SQL execution, improves resource management, and ensures predictable performance for large-scale analytics workloads.
-
IBM Releases Granite-Docling-258M, a Compact Vision-Language Model for Precise Document Conversion
IBM Research has recently introduced Granite-Docling-258M, a new open-source vision-language model (VLM) designed for high-fidelity document-to-text conversion while preserving complex layouts, tables, equations, and lists.
-
How Netflix Powers Audience Insights at Trillion-Row Scale
In a recent blog post, Netflix engineers described how they scaled Muse, the company’s internal application for data-driven creative insights, to handle trillion-row datasets.
-
Google Launches Gemini CLI: Open-Source Terminal AI Agent for Developers
Google has released Gemini CLI, a new open-source AI command-line interface that brings the full capabilities of its Gemini 2.5 Pro model directly into developers’ terminals. Designed for flexibility, transparency, and developer-first workflows, Gemini CLI provides high-performance, natural language AI assistance through a lightweight, locally accessible interface.
-
The Open-Source Version of InfluxDB 3 Reaches GA
Two years after releasing the GA version of InfluxData’s enterprise edition, their open-source version also reached that level of maturity. Conceptualised for real-time workloads and ease of running, the core version leaves aside features like long-term storage optimisations, compaction or high availability (HA), read replicas, or fine-grained access controls.
-
Confluent Cloud Streamlines Real-Time AI with New Flink Capabilities
Confluent Cloud now simplifies real-time AI development with Flink Native Inference, Flink search, and built-in ML functions—streamlining workflows and reducing costs. This unified Flink SQL solution enhances security and efficiency by keeping data secure within the platform, empowering developers to deploy AI applications quickly and effectively.
-
Optimizing Uber's Search Infrastructure: Upgrading to Apache Lucene 9.5
Uber Engineering recently announced an upgrade to their search infrastructure, transitioning from Apache Lucene 8.0 to version 9.5. This upgrade improves Uber's search capabilities, performance and efficiency across their various services.
-
Microsoft Introduces Drasi: Open-Source System for Real-Time Event Processing and Automation
Microsoft’s Azure Incubations team introduced Drasi, an open-source system that simplifies detecting critical events in complex infrastructures. Drasi offers real-time monitoring and automated responses, eliminating the need for manual event handling. With flexible components and integrations, it streamlines change detection across various data sources.
-
Cloud Native Computing Foundation Graduation of CloudEvents: Q&A with Clemens Vasters
Earlier this year, the Cloud Native Computing Foundation (CNCF) announced that CloudEvents had graduated. CloudEvents is a specification designed to expose event metadata in a standardized manner, which helps to ensure interoperability across platforms, services, and systems.
-
Apache Pekko: Simplifying Concurrent Development with the Actor Model
Apache Pekko is an open-source framework designed to simplify the development of concurrent, distributed, resilient, and elastic applications. Leveraging the Actor Model, Pekko offers high-level abstractions for concurrency, allowing developers to focus on business logic rather than low-level implementation details.
-
Grafana Tempo Introduces New Query Language and Support for Apache Parquet
Grafana has released Grafana Tempo 2.0 which introduces the new TraceQL query language and support for the Apache Parquet format. Grafana Tempo is an open-source tracing backend that works with object storage. The new TraceQL query language works with the Apache Parquet format to provide improved search times and queries aligned to traces.
-
Apache Linkis Graduated to Apache Top-Level Project
Apache Linkis is a computation middleware that acts as a layer between upper-level applications and underlying engines, such as Apache Spark, Apache Hive and Apache Flink. It started as an Apache Incubator project in 2021 and graduated to a Top Level Project in January 2023.
-
Apache Druid 25.0 Delivers Multi-Stage Query Engine and Kubernetes Task Management
Apache Druid is a high-performance real-time datastore and its latest release, version 25.0, provides many improvements and enhancements. The main new features are: the multi-stage query (MSQ) task engine used for SQL-based ingestion is now production ready, and Kubernetes can be used to launch and manage tasks eliminating the need for middle managers...
-
Apache ShenYu: Java Responsive API Gateway Announced as Apache Top-Level Project
The Apache Software Foundation (ASF) has announced that Apache ShenYu, an asynchronous, high-performance, and responsive API Gateway for service proxy, protocol conversion, and API governance, has been promoted as a Top-Level Project. It is written in Java, but supports multiple languages such as Python, Go and .NET.
-
Java News Roundup: Classfile API Draft, Spring Boot, GlassFish, Project Reactor, Micronaut
This week's Java roundup for June 20th, 2022, features news from OpenJDK, JDK 19, JDK 20, Spring point releases, GlassFish 7.0.0-M6, GraalVM Native Build Tools 0.9.12, Micronaut 3.5.2, Quarkus 2.10.0, Project Reactor 2022.0.0-M3, Apache Camel Quarkus 2.10.0, and Apache Tika versions 2.4.1 and 1.28.4.