InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Large-Scale Stream Processing with Apache Kafka

Neha Narkhede explains how Apache Kafka was designed to support capturing and processing distributed data streams by building up the basic primitives needed for a stream processing system.

Neha Narkhede
on Jul 03, 2016

Icon

50:46
AI, ML & Data Engineering

Online Data Mining and Machine Learning

Edo Liberty presents some basic concepts and an introduction to the subfields of machine learning and data mining.

Edo Liberty
on Jul 01, 2016

Icon

34:20
AI, ML & Data Engineering

Introducing Apache Ignite

Christos Erotocritou introduces Apache Ignite, discussing how it is used to solve some of the most demanding scalability and performance challenges. He covers typical use cases and examples.

Christos Erotocritou
on Jun 24, 2016

Icon

45:50
AI, ML & Data Engineering

Building a Predictive Intelligence Engine

Viral Bajaria explains a formula for reaching the B2B buyer early in the sales cycle by tying together billions of rows of customer data and overlaying predictive intelligence technology.

Viral Bajaria
on Jun 24, 2016

Icon

40:09
AI, ML & Data Engineering

The Future of Data Science

The panelists discuss some of the trends in data science today, the job of a data scientist, the tools and other related issues.

Peter Bakas Pushpraj Shukla Alexander Lavin Leo Li Mike Tamir Milind Bhandarkar
on Jun 19, 2016

Icon

50:05
Development

APIs, Spreadsheets & Drinking Fountains: Using Open Data in Real Life

Shelby Switzer discusses success stories and failures of using the public data provided by governments, along with techniques for making such data usable.

Shelby Switzer
on Jun 11, 2016

Icon

23:44
AI, ML & Data Engineering

Detecting Anomalies in Streaming Data, Evaluating Algorithms for Real-World Use

Alexander Lavin introduces the Numenta Anomaly Benchmark (NAB), a framework for evaluating anomaly detection algorithms on streaming data.

Alexander Lavin
on Jun 01, 2016

Icon

32:32
AI, ML & Data Engineering

GoshawkDB: Making Time with Vector Clocks

Matthew Sackman discusses dependencies between transactions, how to capture these with Vector Clocks, how to treat Vector Clocks as a CRDT, and how GoshawkDB uses them for a distributed data store.

Matthew Sackman
on May 29, 2016

Icon

50:32
AI, ML & Data Engineering

Predicting the Future: Surprising Revelations trom Truly Big Data

Pushpraj Shukla discusses how Microsoft Bing predicts the future based on aggregate human behavior using one of the largest scale data sets, and recent progress in large scale deep learnt models.

Pushpraj Shukla
on May 24, 2016

Icon

37:06
AI, ML & Data Engineering

Staying in Sync: from Transactions to Streams

Martin Kleppmann explores using event streams and Kafka for keeping data in sync across heterogeneous systems, and compares this approach to distributed transactions.

Martin Kleppmann
on May 20, 2016

Icon

48:25
AI, ML & Data Engineering

Netflix Keystone - How We Built a 700B/day Stream Processing Cloud Platform in a Year

Peter Bakas presents in detail how Netflix has used Kafka, Samza, Docker, and Linux to implement a multi-tenant pipeline processing 700B events/day in the Amazon AWS cloud.

Peter Bakas
on May 19, 2016

Icon

40:32
AI, ML & Data Engineering

Hunting Criminals with Hybrid Analytics

David Talby demos using Python libraries to build a ML model for fraud detection, scaling it up to billions of events using Spark, and what it took to make the system perform and ready for production.

David Talby
on May 10, 2016

Icon

41:26

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations