Older rss

Learning Paths: QCon London Expert Recommendations

Posted by Wesley Reisz on  Feb 16, 2017

Advice on the best talks to attend at QCon London 2017 from London Thought Leaders.

Q&A with Immuta on the Implications of EU’s General Data Protection Regulation (GDPR)

Posted by Manuel Pais on  Feb 10, 2017

InfoQ talked with Immuta’s Andrew Burt and Steve Touw, to better understand the implications and challenges of the EU's Global Data Protection Regulation, which will come into effect in May 2018.

Cassandra: The Definitive Guide, 2nd Edition Book Review and Interview

Posted by Srini Penchikala on  Jan 05, 2017

Cassandra: The Definitive Guide, 2nd Edition book authored by Jeff Carpenter and Eben Hewitt covers the Cassandra NoSQL database version 3.0. InfoQ spoke with the co-author Jeff Carpenter.

Article Series: Getting a Handle on Data Science

Posted by Francine Bennett on  Dec 05, 2016

In this series we explore ways of making sense of data science - understanding where it’s needed and where it’s not, and how to make it an asset for you, from people who’ve been there and done it.

Peter Cnudde on How Yahoo Uses Hadoop, Deep Learning and Big Data Platform

Posted by Srini Penchikala on  Oct 13, 2016

Yahoo uses Hadoop for different use cases in big data & machine learning areas. InfoQ spoke with Peter Cnudde on how Yahoo leverages big data technologies.

Traffic Data Monitoring Using IoT, Kafka and Spark Streaming

Posted by Amit Baghel on  Sep 28, 2016

Internet of Things (IoT) is an emerging technology. One of the areas of IoT is the connected vehicles. In this article, we'll use Spark and Kafka to analyse and process IoT connected vehicle's data. 8

Big Data Processing with Apache Spark - Part 5: Spark ML Data Pipelines

Posted by Srini Penchikala on  Sep 24, 2016

In this fifth installment of Apache Spark article series, author Srini Penchikala discusses Spark ML package and how to use it to create and manage machine learning data pipelines. 2

Spark GraphX in Action Book Review and Interview

Posted by Srini Penchikala on  Sep 12, 2016

InfoQ spoke with authors of Spark GraphX in Action book, Apache Spark framework and what's coming up in the area of graph data processing and analytics.

Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines

Posted by Dylan Raithel on  Aug 29, 2016

InfoQ interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline