Older rss

Exploring Wikipedia with Apache Spark: A Live Coding Demo

Posted by Sameer Farooqui  on  Aug 23, 2016

Sameer Farooqui demos connecting to the live stream of Wikipedia edits, building a dashboard showing what’s happening with Wikipedia datasets and how people are using them in real time.


Real-time Stream Computing & Analytics @Uber

Posted by Sudhir Tonse  on  Apr 09, 2016

Sudhir Tonse discusses using stream processing at Uber: indexing and querying of geospatial data, aggregation and computing of streaming data, extracting patterns, TimeSeries analyses and predictions.


Developing Real-time Data Pipelines with Apache Kafka

Posted by Joe Stein  on  Mar 04, 2016

Joe Stein makes an introduction for developers about why and how to use Apache Kafka. Apache Kafka is a publish-subscribe messaging system rethought of as a distributed commit log.


Pulsar: Real-time Analytics at Scale

Posted by Sharad Murthy, Tony Ng  on  Sep 13, 2015

Sharad Murthy & Tony Ng present Pulsar, a real-time streaming system which can scale to millions of events per second with high availability and 4GL language support.


Machine Learning and IoT

Posted by Ajit Jaokar  on  Aug 29, 2015

Ajit Jaokar discusses data science and IoT: sensor data, real-time processing, cognitive computing, integration of IoT analytics with hardware, IoT’s impact on healthcare, automotive, wearables, etc.


How 30 Years of Ticket Transaction Data Helps you Discover New Shows!

Posted by Vaclav Petricek  on  Aug 19, 2015

Vaclav Petricek discusses how to train models, architect and build a scalable system powered by Storm, Hadoop, Spark, Spring Boot and Vowpal Wabbit that meets SLAs measured in tens of milliseconds.


Java 8 in Anger

Posted by Trisha Gee  on  Aug 09, 2015 1

Trisha Gee uses Java 8 streams and lambdas to build an app consuming a real-time feed of high velocity data, using services to make sense of the data, and presenting it in a JavaFX dashboard.


IoT Realized - The Connected Car

Posted by Derek Beauregard,Phil Berman,Michael Minella,Darrel Sharpe  on  Mar 14, 2015

This session explores the power of Spring XD in the context of the Internet of Things (IoT).


Better Together - Using Spark and Redshift to Combine Your Data with Public Datasets

Posted by Eugene Mandel  on  Mar 12, 2015

Eugene Mandel discusses challenges of conforming data sources and compares processing stacks: Hadoop+Redshift vs Spark, showing how the technology drives the way the problem is modeled.


Applications of Enterprise Integration Patterns to Near Real-Time Radar Data Processing

Posted by Garrett Wampole  on  Dec 28, 2014

Garrett Wampole describes an experimental methodology of applying Enterprise Integration Patterns to the near real-time processing of surveillance radar data, developed by MITRE.


Design Patterns for Large-Scale Real-Time Learning

Posted by Sean Owen  on  Apr 15, 2014

Sean Owen provides examples of operational analytics projects, presenting a reference architecture and algorithm design choices for a successful implementation based on his experience Oryx/Cloudera.


From The Lab To The Factory: Building A Production Machine Learning Infrastructure

Posted by Josh Wills  on  Jan 16, 2014

Josh Wills discusses using Hadoop technologies to build real-time data analysis models with a focus on strategies for data integration, large-scale machine learning, and experimentation.

General Feedback
Marketing and all content copyright © 2006-2016 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.