BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Data Science Follow 265 Followers

Reactive Summit 2016 Conference: Reactive Microservices and Staging Data Pipelines

by Srini Penchikala Follow 13 Followers on  Oct 08, 2016

Reactive microservices, data center scale operating system (DCOS), and staging reactive data pipelines were the highlighted topics at Reactive Summit 2016 Conference held this week. InfoQ team attended the conference and this post is a summary of the first day's events at the conference.

Data Science Follow 265 Followers

Confluent Announces Kafka for the Enterprise with Multi-Datacenter Replication

by Srini Penchikala Follow 13 Followers on  Oct 05, 2016

Confluent Enterprise latest version supports multi-datacenter replication, automatic data balancing, and cloud migration capability. Confluent, provider of the Apache Kafka based streaming platform, announced last week the new features for Confluent Enterprise, to help build streaming data pipelines and develop stream processing applications.

Cloud Follow 55 Followers

Amazon Kinesis Analytics is Like SaaS for Big Data Analysis

by Elton Stoneman Follow 0 Followers on  Sep 14, 2016

Real-time analysis of event streams has a new focus in Big Data platforms, both on-premise and in the cloud. AWS have released Amazon Kinesis Analytics, a rival to Azure StreamAnalytics. Both platforms use a simple SQL language for complex querying, and move Big Data analysis into a SaaS-like space.

Data Science Follow 265 Followers

IBM Creates Artificial Neurons from Phase Change Memory for Cognitive Computing

by Srini Penchikala Follow 13 Followers on  Sep 13, 2016

A team of scientists at IBM Research in Zurich, have created an artificial version of neurons using phase-change materials to store and process data. These phase change based artificial neurons can be used to detect patterns and discover correlations in Big Data (real-time streams of event based data) and unsupervised machine learning at high speeds using very little energy.

Culture & Methods Follow 173 Followers

Getting the Data Needed for Data Science

by Ben Linders Follow 8 Followers on  Sep 02, 2016

Data science is about the data that you need; deciding which data to collect, create, or keep is fundamental argues Lukas Vermeer, an experienced Data Science professional and Product Owner for Experimentation at Booking.com. True innovation starts with asking big questions, then it becomes apparent which data is needed to find the answers you seek.

Cloud Follow 55 Followers

Azure Premium Messaging Service Reaches General Availability

by Kent Weare Follow 7 Followers on  Jul 31, 2016

On July 15th, Microsoft announced the Azure Premium Messaging service has reached General Availability (GA). Premium Messaging targets customers who would like more predictable messaging performance. InfoQ reached out to Dan Rosanova, Principal Program Manager on the Azure Service Bus team for additional insight into this milestone.

Development Follow 141 Followers

Basho Open Sources Time Series Database Riak TS 1.3

by Rags Srinivas Follow 2 Followers on  Jul 15, 2016

InfoQ's Rags Srinivas talks to Basho's CTO Dave McCrory about the open sourcing of Riak TS 1.3 which is geared to handle time series data.

Data Science Follow 265 Followers

Meson Workflow Orchestration and Scheduling Framework for Netflix Recommendations

by Srini Penchikala Follow 13 Followers on  Jul 10, 2016

Netflix's goal is to predict what you want to watch before you watch it. They do this by running a number of machine learning (ML) workflows every day. Meson is a workflow orchestration and scheduling framework that manages the lifecycle of all these machine learning pipelines that build, train and validate personalization algorithms to help with the video recommendations.

Development Follow 141 Followers

Google BigQuery Now Allows to Query All Open-Source Projects on GitHub

by Sergio De Simone Follow 5 Followers on  Jul 08, 2016 2

A full snapshot of more than 2.8 million open source project hosted on GitHub is now available in Google’s BigQuery, Google and GitHub announced. This will make it possible to query almost 2 billion source files hosted on GitHub using SQL.

Architecture & Design Follow 604 Followers

Neha Narkhede: Large-Scale Stream Processing with Apache Kafka

by Ralph Winzinger Follow 0 Followers on  Jun 19, 2016

In her presentation "Large-Scale Stream Processing with Apache Kafka" at QCon New York 2016, Neha Narkhede introduces Kafka Streams, a new feature of Kafka for processing streaming data. According to Narkhede stream processing has become popular because unbounded datasets can be found in many places. It is no longer a niche problem like, for example, machine learning.

Data Science Follow 265 Followers

LinkedIn Details Production Kafka Debugging and Best Practices

by Dylan Raithel Follow 4 Followers on  Jun 16, 2016

LinkedIn’s Joel Koshy details their Kafka usage, debugging and monitoring two production incidents in using the core Kafka infrastructure concepts, semantics and behavioral patterns to plan for and detect similar problems in the future.

Data Science Follow 265 Followers

LinkedIn Details Open-Sourced Kafka Monitor

by Dylan Raithel Follow 4 Followers on  Jun 08, 2016

LinkedIn recently detailed open-sourced Kafka Monitor service that they're using to monitor production Kafka clusters as well as extensive testing automation, leading them to identify bugs in the main Kafka trunk and contribute solutions to the open-source community.

Data Science Follow 265 Followers

Confluent Platform 3.0 Supports Kafka Streams for Real-Time Data Processing

by Srini Penchikala Follow 13 Followers on  Jun 03, 2016 2

Confluent Platform 3.0 messaging system from Confluent, the company behind Apache Kafka messaging framework, supports Kafka Streams for real-time data processing. The company announced last week the general availability of the latest version of the open source Confluent platform.

Data Science Follow 265 Followers

Cloudera Announces Partnership with the Broad Institute

by Dylan Raithel Follow 4 Followers on  Jun 02, 2016

Cloudera announced their partnership with MIT & Harvard's Broad Institute and detailed some of their experience with the Genome Analytics Toolkit pipeline.

Data Science Follow 265 Followers

Apache Spark 2.0 Technical Preview

by Alex Giamas Follow 3 Followers on  May 31, 2016

Two years after the first release of Apache Spark, Databricks announced the technical preview of Apache Spark 2.0 , based on upstream branch 2.0.0-preview. The preview is not ready for production, neither in terms of stability nor API, but is a release intended to gather feedback from the community ahead of the general availability of the release.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT