InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Big Data in Capital Markets
The authors present design patterns and use cases of capital market firms that are incorporating big data technologies into their credit risk analysis, price discovery or sentiment analysis software.
-
A/B Testing + Continuous Delivery = Everyday Product Launches
Nellwyn Thomas discusses how Etsy is using A/B testing, how Etsy's data-driven culture has evolved over time and how continuous delivery and big data can coexist.
-
How Facebook Scales Big Data Systems
Jeff Johnson introduces Apollo, a hierarchical NoSQL data system meant to deal with Facebook's distributed storage needs.
-
Migrating to Cloud Native with Microservices
Adrian Cockcroft discusses strategies, patterns and pathways to perform a gradual migration towards modern enterprise applications based on cloud, microservices and denormalized NoSQL databases.
-
Analyzing Big Data On The Fly
Shawn Gandhi overviews real-time processing use cases, and how developers are using AWS Kinesis to shift from a traditional batch-oriented approach to a continual real-time data processing model.
-
How WebMD Maintains Operational Flexibility with NoSQL
Rajeev Borborah, Matthew Wilson discuss using NoSQL at WebMD -architecture, benefits, roadmap-, with details on caching and key-value storage implementation behind a few of the WebMD applications.
-
The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE
Randy Shoup describes KIXEYE's analytics infrastructure from Kafka queues through Hadoop 2 to Hive and Redshift, built for flexibility, experimentation, iteration, testability, and reliability.
-
Doing Data Science with F#
Tomas Petricek introduces F#’s capabilities in dealing with scientific data: type providers -CSV, XML, JSON, REST-, interactive development, data visualization libraries, integration with R or MathLab
-
The Next Wave of SQL-on-Hadoop: The Hadoop Data Warehouse
Marcel Kornacker presents a case study of an EDW built on Impala running on 45 nodes, reducing processing time from hours to seconds and consolidating multiple data sets into one single view.
-
Finding the Needle in a Big Data Haystack
In this solutions track talk, sponsored by Cloudera, Eva Andreasson discusses how search and Hadoop can help with some of the industry's biggest challenges. She introduces the data hub concept.
-
A Big Data Arsenal for the 21st Century
In this solutions track talk, sponsored by MongoDB, Matt Asay discusses the differences between some of the NoSQL and SQL databases and when Hadoop makes sense to be used with a NoSQL solution.
-
Data Movement at Very Large Scale
In this solutions track talk, sponsored by Solace Systems, Aaron Lee discusses the challenges moving information and techniques that can increase efficiency of data flows within big data architectures