InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Data Driven Action: A Primer on Data Science
S Aerni, S Ramanujam and J Vawdrey present approaches and open source tools for wrangling and modeling massive datasets, scaling Java applications for NLP on MPP through PL/Java and much more.
-
Boot Your Search with Spring
Christoph Strobl focuses on integrating search solutions like Solr, Elasticsearch as well as MongoDBs full text search into an application.
-
How Netflix Directs 1/3rd of Internet Traffic
Haley Tucker and Mohit Vora discuss the architecture at Netflix that makes streaming happen, while highlighting interesting lessons and design patterns that can be widely applied.
-
Stream Processing in Uber
Danny Yuan discusses how Uber uses stream processing to solve a wide range of problems, including real-time aggregation and prediction on geospatial time series, and much more.
-
Powering the Industrial Enterprise: Introducing the IOT Platform-as-a-Service
Jesus Rodriguez explores the characteristics of the IOT PaaS vs. predecessor PaaS architectures, focusing on device management, event driven integration, real-time analytics and offline communication.
-
Tuning Java for Big Data
Scott Seighman discusses causes of common performance issues in Big Data environments, heap size, garbage collection, JVM reuse tuning guidelines and Big Data performance analysis tools.
-
Data Driven Product Development
Lyndon Maher, Paul McManus discuss data driven development, how to collect data, getting feedback, tools to use, and how to integrate a data-driven mentality into the team.
-
Modeling Complex Game Economy with Neo4j
Yan Cui shares lessons learned using Neo4j to model the in-game economy of the "Here Be Monsters" game and automate the balancing process.
-
Ground-up Introduction to In-memory Data
Viktor Gamov covers In-Memory technology, distributed data topologies, making in-memory reliable, scalable and durable, when to use NoSQL, and techniques for Big In-Memory Data.
-
Pulsar: Real-time Analytics at Scale
Sharad Murthy & Tony Ng present Pulsar, a real-time streaming system which can scale to millions of events per second with high availability and 4GL language support.
-
Exploratory Data Analysis with R
Matthew Renze introduces the R programming language and demonstrates how R can be used for exploratory data analysis.
-
Spreadsheets for Developers
Felienne Hermans presents various algorithms that outlining the power of Excel, showing that spreadsheets are fit for TDD and rapid prototyping.