InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Pyh3: Scalable and High Performance Graph Visualization in 3D Hyperbolic Space
Songxiao Zhang introduces Pyh3, a graph visualization library showing tree nodes in a 3D hyperbolic space.
-
Using NLP to Categorize and Find Similar Web Pages
Thomas Levi shows how to categorize web pages by building a system that exploits techniques in natural language processing and topic modeling.
-
Data Visualization with R
Matthew Renze introduces the R programming language as well as demonstrates how R can be used to create data visualizations to complete day-to-day developer tasks.
-
Integrating Hybrid Cloud Database-as-a-Service with Cloud Foundry’s Service Broker
Lenley Hensarling describes how EnterpriseDB Cloud Management can provide responsible DevOps models for the enterprise.
-
Achieving Mega-Scale Business Intelligence through Speed of Thought Analytics on Hadoop
Ian Fyfe discusses the different options for implementing speed-of-thought business analytics and machine learning tools directly on top of Hadoop.
-
Hydrator: Open Source, Code-Free Data Pipelines
Jonathan Gray introduces Hydrator, an open source framework and user interface for creating data lakes for building and managing data pipelines on Spark, MapReduce, Spark Streaming and Tigon.
-
Developing a Machine Learning Based Predictive Analytics Engine for Big Data Analytics
Ali Jalali presents how to develop a machine learning predictive analytics engine for big data analytics.
-
MongoDB-as-a-Service on Pivotal Cloud Foundry
Mallika Iyer and Sam Weaver cover a brief overview of Pivotal Cloud Foundry and deep dive into running MongoDB as a managed service on this platform.
-
Building an AI in the Cloud
Simon Chan shares the on-going challenges, the design dilemma and the steps to be taken when building customized large-scale predictive ML applications on a ML SaaS platform.
-
Wall St. Derivative Risk Solutions Using Geode
Andre Langevin and Mike Stolz discuss how Geode forms the core of many Wall Street derivative risk solutions which provide cross-product risk management at speeds suitable for automated hedging.
-
The Joy of Analysis Development
Hilary Parker discusses the history of the analysis development tools, the current state of the art, and the importance for data scientists and analysts to understand programming principles.
-
Machine Learning Fast and Slow
Suman Deb Roy talks about some of Betaworks’ internal data tools and platform, product-specific solutions and best practices they learned when machine learning has to drive the startup road.