Lambda architecture has been a popular solution that combines batch and stream processing. Kartik Paramasivam at LinkedIn wrote about how his team addressed stream processing and Lambda architecture challenges using Apache Samza for data processing. The challenges described are the late arrival of events and the processing of duplicated messages.
The NetBeans Community blog has announced that Oracle is proposing to entrust the development of the NetBeans platform and IDE to the Apache Foundation to “open up the government model,” reaffirming its commitment to the project.
TinkerPop, a graph compute framework for OLTP and OLAP graph database and analytics processing graduated to top-level project with the Apache Software Foundation.
Two years after the first release of Apache Spark, Databricks announced the technical preview of Apache Spark 2.0 , based on upstream branch 2.0.0-preview. The preview is not ready for production, neither in terms of stability nor API, but is a release intended to gather feedback from the community ahead of the general availability of the release.
AirFlow recently joined the Apache Incubator program. AirFlow is a workflow and scheduling system designed to manage data pipelines. Developed by AirBnb for their internal usage, it was open sourced last September, as previously reported by InfoQ.
Late last month Google released an alpha version of their TensorFlow (TF) integrated cloud machine learning service as a response to a growing need to make their Tensor Flow library to run at scale on the Google Cloud Platform (GCP). Google describes several new feature sets around making TF usage scale by integrating several pieces of the GCP like Dataproc, a managed Hadoop and Spark service.
We distill here some of the reactions around the web regarding Google’s adoption of OpenJDK for future Android versions.
According to a recent security analysis by Foxglove Security suggests that applications using deserialization may be vulnerable to a zero-day exploit. This includes libraries including OpenJDK, Apache Commons, Spring and Groovy. InfoQ investigates.
As recently published in InfoQ, the Apache Software Foundation announced the end of life of version 1 of Log4j, encouraging users to upgrade to version 2 of the popular logging framework. InfoQ reached out to the members of the Apache Logging Services Team to find out more about the transition to the new version of Log4j and about its future.
Basho Data Platform supports integration with NoSQL databases like Redis, in-memory analytics, caching, and search. Basho Technologies, the company behind Riak NoSQL database, announced in May, the availability of the data platform that can be used to deploy and manage Big Data, IoT and hybrid cloud applications.
On June 12, 2015, the Jetty Project released version 9.3 of their flagship open source embedded application server, that day being the 20th anniversary of the project's beginning. Features of the release include HTTP/2 server (and client) support, Java 8 as a minimum, more Java NIO integration and an overhauled scheduler. They also removed SPDY networking support and fixed over 400 bugs.
Stripe, the internet payments infrastructure company recently announced open sourcing a set of internally developed tools based on Apache Hadoop.Timberlake, Brushfire, Sequins and Herringbone all contribute to enriching the available tools for building an Apache Hadoop stack.
GridGain's In-Memory Data Fabric entered Apache Incubator last October under the name of Apache Ignite. The company donated its flagship in-memory computing platform to the Apache Software Foundation with the intention of attracting external developers and growing a viable community around its core technology.
At the recent GOTO conference in Berlin, Mahout committer Sebastian Schelter outlined recent advances in Mahout's ongoing effort to create a scalable foundation for data analysis that is as easy to use as R or Python.
Clocker, an open source project, enables users to manage and auto-scale Docker containers, in a cloud-agnostic manner. The project is built on top of Apache Brooklyn, a multi-cloud application, management software.