BT
rss
  • AI, ML & Data Engineering Follow 763 Followers

    Apache Beam Interview with Frances Perry

    by Dylan Raithel Follow 9 Followers on  Jun 20, 2017

    InfoQ Interviews Apache Beam's Frances Perry about the impetus for using Beam and the future of the top-level open source project and covers the thoughts behind the programming model as well as some of the touch-points in integration with other data engineering tools like Apache Spark and Flink.

  • AI, ML & Data Engineering Follow 763 Followers

    Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX

    by Srini Penchikala Follow 34 Followers on  Mar 14, 2017 2

    In this article, author Srini Penchikala discusses Apache Spark GraphX library used for graph data processing and analytics. The article includes sample code for graph algorithms like PageRank, Connected Components and Triangle Counting.

  • AI, ML & Data Engineering Follow 763 Followers

    Traffic Data Monitoring Using IoT, Kafka and Spark Streaming

    by Amit Baghel Follow 0 Followers on  Sep 28, 2016 16

    Internet of Things (IoT) is an emerging disruptive technology and becoming an increasing topic of interest. One of the areas of IoT application is the connected vehicles. In this article we'll use Apache Spark and Kafka technologies to analyse and process IoT connected vehicle's data and send the processed data to real time traffic monitoring dashboard.

AI, ML & Data Engineering Follow 763 Followers

Big Data Processing with Apache Spark - Part 5: Spark ML Data Pipelines

Posted by Srini Penchikala Follow 34 Followers on  Sep 24, 2016

In this fifth installment of Apache Spark article series, author Srini Penchikala discusses Spark ML package and how to use it to create and manage machine learning data pipelines. 2

AI, ML & Data Engineering Follow 763 Followers

Spark GraphX in Action Book Review and Interview

Posted by Srini Penchikala Follow 34 Followers on  Sep 12, 2016

InfoQ spoke with authors of Spark GraphX in Action book, Apache Spark framework and what's coming up in the area of graph data processing and analytics.

AI, ML & Data Engineering Follow 763 Followers

Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines

Posted by Dylan Raithel Follow 9 Followers on  Aug 29, 2016

InfoQ interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline

AI, ML & Data Engineering Follow 763 Followers

Big Data Processing with Apache Spark - Part 4: Spark Machine Learning

Posted by Srini Penchikala Follow 34 Followers on  May 15, 2016

In this fourth installment of Apache Spark article series, author Srini Penchikala discusses machine learning concept & Spark MLlib library for running predictive analytics using a sample application.

AI, ML & Data Engineering Follow 763 Followers

Big Data Processing with Apache Spark - Part 3: Spark Streaming

Posted by Srini Penchikala Follow 34 Followers on  Jan 07, 2016

In this article, third installment of Apache Spark series, author discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application. 7

AI, ML & Data Engineering Follow 763 Followers

Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library

Posted by Konur Unyelioglu Follow 0 Followers on  Dec 22, 2015

In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.

AI, ML & Data Engineering Follow 763 Followers

Big Data Processing with Apache Spark - Part 2: Spark SQL

Posted by Srini Penchikala Follow 34 Followers on  Apr 16, 2015

Spark SQL, part of Apache Spark, is used for structured data processing by running SQL queries on Spark data. Srini Penchikala discusses Spark SQL module & how it simplifies data analytics using SQL. 1

AI, ML & Data Engineering Follow 763 Followers

Big Data Processing with Apache Spark – Part 1: Introduction

Posted by Srini Penchikala Follow 34 Followers on  Jan 30, 2015

Apache Spark is an open source big data framework built around speed, ease of use, and sophisticated analytics. In this article, Srini Penchikala discusses how Spark helps with big data processing. 8

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT