InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Extreme Programming Meets Real-time Data
Tom Johnson and Gel Goldsby talk about scaling problems they encountered at Unruly, and where extreme programming values led them.
-
AI in Medicine
Anthony Chang presents the current status of AI in medicine and the foreseeable future in front of it.
-
Products and Prototypes with Keras
Micha Gorelick shows how to build a working product with Keras, a high-level deep learning framework, discussing design decisions, and demonstrating how to train and deploy a model.
-
Causal Consistency for Large Neo4j Clusters
Jim Webber explores the new Causal clustering architecture for Neo4j, how it allows users to read writes straightforwardly, explaining why this is difficult to achieve in distributed systems.
-
Deep Learning at Scale
Scott Le Grand describes his work at NVidia, Amazon and Teza, including the DSSTNE distributed deep learning framework.
-
Building Robust Machine Learning Systems
Stephen Whitworth talks about his experience at Ravelin, and provides useful practices and tips to help ensure our machine learning systems are robust, well audited, avoid embarrassing predictions.
-
Big Data Infrastructure @ LinkedIn
Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.
-
Using NLP, Machine Learning & Deep Learning Algorithms to Extract Meaning from Text
David Talby walks through building a natural language annotations pipeline with domain-specific annotators, and using deep learning to automatically expand and update taxonomies.
-
Scaling up Near Real-Time Analytics @Uber &LinkedIn
Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.
-
Real-Time Recommendations Using Spark Streaming
Elliot Chow discusses the data pipeline that they built with Kafka, Spark Streaming, and Cassandra to process Netflix user activities in real time for the Trending Now row.
-
Effective Data Pipelines: Data Mngmt from Chaos
Katharine Jarmul discusses implementation decisions for those looking for a practical recommendation on the "what" and "how" of data automation workflows.
-
The Move to AI: from HFT to Laplace Demon
Eric Horesnyi and Albert Bifet discuss how hedge funds have moved beyond High Frequency Trading using AI and real-time data processing.