InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Server-Less Design Patterns for the Enterprise with AWS Lambda
Tim Wagner defines server-less computing, examines the key trends and innovative ideas behind the technology, and looks at design patterns for big data, event processing, and mobile using AWS Lambda.
-
Vowpal Wabbit, A Machine Learning System
John Langford discusses how to use Vowpal Wabbit in and as a machine learning system including architecture, unique capabilities, and applications, applied to personalized news recommendation.
-
Large-Scale Stream Processing with Apache Kafka
Neha Narkhede explains how Apache Kafka was designed to support capturing and processing distributed data streams by building up the basic primitives needed for a stream processing system.
-
Online Data Mining and Machine Learning
Edo Liberty presents some basic concepts and an introduction to the subfields of machine learning and data mining.
-
Introducing Apache Ignite
Christos Erotocritou introduces Apache Ignite, discussing how it is used to solve some of the most demanding scalability and performance challenges. He covers typical use cases and examples.
-
Building a Predictive Intelligence Engine
Viral Bajaria explains a formula for reaching the B2B buyer early in the sales cycle by tying together billions of rows of customer data and overlaying predictive intelligence technology.
-
The Future of Data Science
The panelists discuss some of the trends in data science today, the job of a data scientist, the tools and other related issues.
-
APIs, Spreadsheets & Drinking Fountains: Using Open Data in Real Life
Shelby Switzer discusses success stories and failures of using the public data provided by governments, along with techniques for making such data usable.
-
Detecting Anomalies in Streaming Data, Evaluating Algorithms for Real-World Use
Alexander Lavin introduces the Numenta Anomaly Benchmark (NAB), a framework for evaluating anomaly detection algorithms on streaming data.
-
GoshawkDB: Making Time with Vector Clocks
Matthew Sackman discusses dependencies between transactions, how to capture these with Vector Clocks, how to treat Vector Clocks as a CRDT, and how GoshawkDB uses them for a distributed data store.
-
Predicting the Future: Surprising Revelations trom Truly Big Data
Pushpraj Shukla discusses how Microsoft Bing predicts the future based on aggregate human behavior using one of the largest scale data sets, and recent progress in large scale deep learnt models.
-
Staying in Sync: from Transactions to Streams
Martin Kleppmann explores using event streams and Kafka for keeping data in sync across heterogeneous systems, and compares this approach to distributed transactions.