InfoQ Homepage Data Content on InfoQ
-
Data Decisions with Real-Time Stream Processing
Serhat Yilmaz talks about how Facebook is using stream processing at scale, the difficulties they have encountered and the solutions they have created to date.
-
Managing Data in Microservices
Randy Shoup discusses managing data in microservices and shares proven patterns and practical advice that has been successful at Google, eBay, and Stitch Fix.
-
Designing Visualizations for Action
Chris Varosy discusses strategies for designing data visualizations and dashboards that bring the insight users need to make decisions.
-
Data Consistency in Microservice Using Sagas
Chris Richardson discusses messaging, durability, and reliability in microservice architectures leveraging the Saga Pattern, explaining how sagas work and introduces a saga framework for Java.
-
Systems That Learn
Stephen Buckley discusses the Systems That Learn initiative which aims to create systems that learn by combining expertise in Systems and Machine Learning.
-
Homoiconicity: It Is What It Is
Stuart Sierra demonstrates the power that comes from having the same data representation at all layers: programming language, specification, database, inter-process communication, and user interface.
-
Data-Driven Coaching - Safely Turning Team Data into Coaching Insights
Troy Magennis shows how to expose data to teams in order for them to retrospect productively, determine if a process experiment is panning out as expected, and to explore process change opportunities.
-
Machine Learning in Academia and Industry
Deborah Hanus discusses some of the challenges that can arise when working with data.
-
AI-Based Data Extraction
George Roth presents the challenges of data extraction from unstructured content in the context of preparing the data for Data Analytics.
-
Data Preparation for Data Science: A Field Guide
Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.
-
Straggler Free Data Processing in Cloud Dataflow
Eugene Kirpichov describes the theory and practice behind Cloud Dataflow's approach to straggler elimination, and the associated non-obvious challenges, benefits, and implications of the technique.
-
Scaling up Near Real-Time Analytics @Uber &LinkedIn
Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.