New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Cassandra: The Definitive Guide, 2nd Edition Book Review and Interview

Posted by Srini Penchikala on  Jan 05, 2017

Cassandra: The Definitive Guide, 2nd Edition book authored by Jeff Carpenter and Eben Hewitt covers the Cassandra NoSQL database version 3.0. InfoQ spoke with the co-author Jeff Carpenter.

Article Series: Getting a Handle on Data Science

Posted by Francine Bennett on  Dec 05, 2016

In this series we explore ways of making sense of data science - understanding where it’s needed and where it’s not, and how to make it an asset for you, from people who’ve been there and done it.

Peter Cnudde on How Yahoo Uses Hadoop, Deep Learning and Big Data Platform

Posted by Srini Penchikala on  Oct 13, 2016

Yahoo uses Hadoop for different use cases in big data & machine learning areas. InfoQ spoke with Peter Cnudde on how Yahoo leverages big data technologies.

Traffic Data Monitoring Using IoT, Kafka and Spark Streaming

Posted by Amit Baghel on  Sep 28, 2016

Internet of Things (IoT) is an emerging technology. One of the areas of IoT is the connected vehicles. In this article, we'll use Spark and Kafka to analyse and process IoT connected vehicle's data. 9

Big Data Processing with Apache Spark - Part 5: Spark ML Data Pipelines

Posted by Srini Penchikala on  Sep 24, 2016

In this fifth installment of Apache Spark article series, author Srini Penchikala discusses Spark ML package and how to use it to create and manage machine learning data pipelines. 2

Spark GraphX in Action Book Review and Interview

Posted by Srini Penchikala on  Sep 12, 2016

InfoQ spoke with authors of Spark GraphX in Action book, Apache Spark framework and what's coming up in the area of graph data processing and analytics.

Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines

Posted by Dylan Raithel on  Aug 29, 2016

InfoQ interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline

Christine Doig on Data Science as a Team Discipline

Posted by Srini Penchikala on  Aug 26, 2016

Christine Doig spoke at OSCON Conference about data science as a team discipline and how to navigate data science Python ecosystem. InfoQ spoke with Christine about challenges of data science teams.

Big Data Analytics with Spark Book Review and Interview

Posted by Srini Penchikala on  Jun 23, 2016

Big Data Analytics with Spark, authored by Mohammed Guller, provides a practical guide for learning Apache Spark. InfoQ and the author discuss the book & development tools for big data applications.