Getting the Data Needed for Data Science

by Ben Linders on  Sep 02, 2016

Data science is about the data that you need; deciding which data to collect, create, or keep is fundamental argues Lukas Vermeer, an experienced Data Science professional and Product Owner for Experimentation at True innovation starts with asking big questions, then it becomes apparent which data is needed to find the answers you seek.

Azure Logic Apps Reaches General Availability

by Kent Weare on  Aug 08, 2016

On July 27th, Microsoft announced their Integration Platform as a Service (iPaaS) offering, Logic Apps has reached General Availability (GA). The GA release includes additional management support, telemetry events, alerts, and consumption-based pricing. InfoQ reached out Jim Harrer, principal group program manager at Microsoft to gain further insight into this Logic Apps release.

Azure Premium Messaging Service Reaches General Availability

by Kent Weare on  Jul 31, 2016

On July 15th, Microsoft announced the Azure Premium Messaging service has reached General Availability (GA). Premium Messaging targets customers who would like more predictable messaging performance. InfoQ reached out to Dan Rosanova, Principal Program Manager on the Azure Service Bus team for additional insight into this milestone.

Basho Open Sources Time Series Database Riak TS 1.3

by Rags Srinivas on  Jul 15, 2016

InfoQ's Rags Srinivas talks to Basho's CTO Dave McCrory about the open sourcing of Riak TS 1.3 which is geared to handle time series data.

Meson Workflow Orchestration and Scheduling Framework for Netflix Recommendations

by Srini Penchikala on  Jul 10, 2016

Netflix's goal is to predict what you want to watch before you watch it. They do this by running a number of machine learning (ML) workflows every day. Meson is a workflow orchestration and scheduling framework that manages the lifecycle of all these machine learning pipelines that build, train and validate personalization algorithms to help with the video recommendations.

Google BigQuery Now Allows to Query All Open-Source Projects on GitHub

by Sergio De Simone on  Jul 08, 2016 2

A full snapshot of more than 2.8 million open source project hosted on GitHub is now available in Google’s BigQuery, Google and GitHub announced. This will make it possible to query almost 2 billion source files hosted on GitHub using SQL.

Docker Swarm Is Dead. Long Live Docker Swarm.

by Elton Stoneman on  Jun 28, 2016

At DockerCon, Docker released version 1.12 of the core product, Docker Engine. The biggest new feature is that Docker Swarm is no longer a separate tool - now it's built into Docker Engine, making it easier to combine multiple Docker hosts into a single logical unit for increased scale and reliability.

DockerCon 2016: A Summary of Announcements and Key Takeaways

by Daniel Bryant on  Jun 26, 2016

At DockerCon 2016, held in Seattle, the latest 1.12 beta version of Docker Engine was announced that includes the integration of Docker Swarm to provide container orchestration. Additional announcement included: the Docker for Mac and Windows has now been made public; a private beta for Docker for AWS and Azure has been opened; and the release of a 'DAB' file format for packaging artifacts.

Neha Narkhede: Large-Scale Stream Processing with Apache Kafka

by Ralph Winzinger on  Jun 19, 2016

In her presentation "Large-Scale Stream Processing with Apache Kafka" at QCon New York 2016, Neha Narkhede introduces Kafka Streams, a new feature of Kafka for processing streaming data. According to Narkhede stream processing has become popular because unbounded datasets can be found in many places. It is no longer a niche problem like, for example, machine learning.

LinkedIn Details Production Kafka Debugging and Best Practices

by Dylan Raithel on  Jun 16, 2016

LinkedIn’s Joel Koshy details their Kafka usage, debugging and monitoring two production incidents in using the core Kafka infrastructure concepts, semantics and behavioral patterns to plan for and detect similar problems in the future.

Kief Morris: Implementing Infrastructure as Code

by Ralph Winzinger on  Jun 14, 2016

Moving applications to the cloud has somewhat become commodity in the meantime - not only for big players, but also for smaller companies that rely on flexibility and resource utilization. In his presentation "Implementing Infrastructure as Code", Kief Morris, cloud practice lead at ThoughWorks, shares some key principles and recommendations on how to leverage cloud based infrastructure.

LinkedIn Details Open-Sourced Kafka Monitor

by Dylan Raithel on  Jun 08, 2016

LinkedIn recently detailed open-sourced Kafka Monitor service that they're using to monitor production Kafka clusters as well as extensive testing automation, leading them to identify bugs in the main Kafka trunk and contribute solutions to the open-source community.

Java 9 Will Remove CORBA from Default Classpath

by Abraham Marín Pérez on  Jun 07, 2016

As part of the ongoing transition to the module system, CORBA and other Java EE modules won't be included in the default classpath from Java 9 onwards. These modules will still be available, but specific command line flags will have to be used to be able to use them. The change will only affect non-modular applications targeting Java 9, for modular ones already need to indicate their dependencies.

Confluent Platform 3.0 Supports Kafka Streams for Real-Time Data Processing

by Srini Penchikala on  Jun 03, 2016 2

Confluent Platform 3.0 messaging system from Confluent, the company behind Apache Kafka messaging framework, supports Kafka Streams for real-time data processing. The company announced last week the general availability of the latest version of the open source Confluent platform.

Cloudera Announces Partnership with the Broad Institute

by Dylan Raithel on  Jun 02, 2016

Cloudera announced their partnership with MIT & Harvard's Broad Institute and detailed some of their experience with the Genome Analytics Toolkit pipeline.

General Feedback
Marketing and all content copyright © 2006-2016 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.