Confluent Platform 3.0 Supports Kafka Streams for Real-Time Data Processing

by Srini Penchikala on  Jun 03, 2016 2

Confluent Platform 3.0 messaging system from Confluent, the company behind Apache Kafka messaging framework, supports Kafka Streams for real-time data processing. The company announced last week the general availability of the latest version of the open source Confluent platform.

Cloudera Announces Partnership with the Broad Institute

by Dylan Raithel on  Jun 02, 2016

Cloudera announced their partnership with MIT & Harvard's Broad Institute and detailed some of their experience with the Genome Analytics Toolkit pipeline.

Apache Spark 2.0 Technical Preview

by Alex Giamas on  May 31, 2016

Two years after the first release of Apache Spark, Databricks announced the technical preview of Apache Spark 2.0 , based on upstream branch 2.0.0-preview. The preview is not ready for production, neither in terms of stability nor API, but is a release intended to gather feedback from the community ahead of the general availability of the release.

Amazon Releases Kinesis Service Update

by Kent Weare on  May 23, 2016

Amazon has recently announced an update to their Amazon Kinesis Service. In this update, three new features have been added to Amazon Kinesis Streams and Amazon Kinesis Firehose including support for Elasticsearch Service Integration, Shard-Level Metrics and Time-Based Iterators.

Precision Medicine Modeling Demonstration with Spark on EMR, ADAM, and the 1000 Genomes Project

by Dylan Raithel on  May 19, 2016

AWS engineers Christopher Crosbie and Ujjwal Ratan detail using Spark on EMR for precision medicine data analysis on the ADAM platform with data from the 1000 genomes project. - Container Platform for Stateful Applications

by Hrishikesh Barua on  May 16, 2016

Supergiant is a container hosting platform built using Kubernetes for distributed, stateful applications.

DevOps Days Kiel Day 1

by Manuel Pais on  May 15, 2016

Summary of DevOps Days Kiel day 1 talks.

The Broad Institute Migrates Genome Sequencing Pipeline to Google Cloud Platform

by Dylan Raithel on  May 13, 2016

Genomic data sequencing and subsequent analysis faces large data volume challenges that several organizations are solving with cloud services. The Broad Institute detailed their experience with petabyte scale sequencing pipelines last month through the Google Research Blog and is detailed here by InfoQ.

Deep Mind Discloses Details to InfoQ about NHS Partnership amid Reports of Vast Patient Data Access

by Dylan Raithel on  May 05, 2016 1

After months of awaiting details about the NHS and Google DeepMind partnership InfoQ gains insights into recent claims of widespread patient data access.

Elephant in the Cloud - Hadoop as a Service

by Srini Penchikala on  May 02, 2016 2

Hadoop and other big data technologies revolutionized the way organizations run data analytics but the organizations are still facing challenges with operating costs of using these technologies for on-premise data processing. Ashish Thusoo recently spoke at Enterprise Data World Conference about Hadoop as a service offering that helps organizations bridge the gaps with these capabilities.

Ehcache 3.0 Released with Revamped API and Off-Heap Storage

by Matt Raible on  May 02, 2016

Terracotta has released version 3 of their distributed caching technology Ehcache, sporting a number of important new features. First, its API has been refactored and now leverages Java generics. Performance has generally been enhanced, and support for the javax.cache API (JSR-107) and off heap storage capabilities have been added.

Microsoft Releases BizTalk Server 2016 CTP 1

by Kent Weare on  May 01, 2016

On March 30th, 2016 Microsoft announced the release of their BizTalk Server 2016 Community Technical Preview 1 (CTP). This release is one of Microsoft’s milestones they highlighted in their recent Integration Roadmap. In addition to the BizTalk Server CTP, Microsoft has also released an initial CTP for its Host Integration Server offering.

AirFlow Joins Apache Incubator

by Alex Giamas on  Apr 29, 2016

AirFlow recently joined the Apache Incubator program. AirFlow is a workflow and scheduling system designed to manage data pipelines. Developed by AirBnb for their internal usage, it was open sourced last September, as previously reported by InfoQ.

Operational Data Stream and Batch Processing at Netflix with Mantis

by Dylan Raithel on  Apr 27, 2016

Operational Data Stream and Batch Processing at Netflix with Mantis

Neo4j 3.0 Released with Binary Communication Protocol and Standardised Drivers

by Alex Blewitt on  Apr 26, 2016

Today at GraphConnect Europe 2016, Neo Technology announced the release of Neo4j 3.0, which includes a new binary protocol for transmitting data between server and client, and a new set of standardised drivers for interacting with the database, along with stored procedure support and higher performance and capacity. InfoQ spoke to Neo Technology to find out more.

General Feedback
Marketing and all content copyright © 2006-2016 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.