InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Corda: Looking Forward and Back, Blockchain on a JVM Stack
Carolyne Quinn and Mike Ward discuss Corda, an open source enterprise blockchain, and look at technologies used: Kotlin, JVMs, pluggable consensus, Corda Foundation and SGX,.
-
Practical NLP for the Real World
Emmanuel Ameisen discusses examples of how to build practical applications using NLP, diving into data visualization and labelling, as well as model validation.
-
Federated Learning: Rewards & Challenges of Distributed Private ML
Eric Tramel discusses the basic concepts underlying the federated ML approach, the advantages it brings, as well as the challenges associated with constructing federated solutions.
-
Michelangelo Palette: A Feature Engineering Platform at Uber
Amit Nene and Eric Chen discuss the infrastructure built by Uber for Michelangelo ML Platform that enables a general approach to Feature Engineering across diverse data systems.
-
Instrumentation, Observability & Monitoring of Machine Learning Models
Josh Wills discusses the monitoring and visibility needs of machine learning models in order to bridge gaps between ML practitioners and DevOps.
-
Life of a Distributed Graph Database Query
Teon Banek describes the life of a query in Memgraph following the process from reading a query as a character string, through planning and distributed execution of query operations.
-
Privacy: The Last Stand for Fair Algorithms
Katharine Jarmul discusses research related to fair-and-private ML algorithms and privacy-preserving models, showing that caring about privacy can help ensure a better model overall and support ethics
-
The Future of Transportation
Anita Sengupta discusses the future of transportation with an eye towards how machine learning and AI will help shape the future.
-
Massive Scale Anomaly Detection Framework
Guy Gerson introduces an anomaly detection framework PayPal uses, focusing on flexibility to support different types of statistical and ML models, and inspired by scikit-learn and Spark MLlib.
-
Modern NLP for Pre-Modern Practitioners
Joel Grus discusses the latest in NLP research breakthrough, and how to incorporate NLP concepts and models into a project.
-
Docker Data Science Pipeline
Lennard Cornelis explains why they chose OpenShift and Docker to connect to the Hadoop environment, also how to set up a Docker container running a data science model using Hive, Python, and Spark.
-
H2O's Driverless AI: An AI That Creates AI
Marios Michailidis shares their approach on automating machine learning using H2O’s Driverless AI.