InfoQ Homepage Database Content on InfoQ
-
Implementing AutoML Techniques at Salesforce Scale
Matthew Tovbin shows how to build ML models using AutoML (Salesforce), including techniques for automatic data processing, feature generation, model selection, hyperparameter tuning and evaluation.
-
How to Use Data Responsibly
Emma Prest and Clare Kitching discuss practical, pragmatic and ethical data science, talking about real world experience from the work of DataKind UK.
-
Privacy Ethics – A Big Data Problem
Raghu Gollamudi broadly covers best practices with respect to Data Management aspects from mapping Enterprise data to applying Data Protection rules like GDPR at petabyte scale.
-
How Machines Help Humans Root Case Issues @ Netflix
Seth Katz discusses ways to build tools designed to enhance the cognitive ability of humans through automated analysis to speed root cause detection in distributed systems.
-
What is a Data Citizen?
Caitlin McDonald discusses how big data affects people online and the ethics to be considered when dealing with data.
-
When Data Kills
Cori Crider shares insights from her investigations of US drone strikes in Yemen and Pakistan, and explores how misuse of mass surveillance data has claimed innocent lives.
-
RDBMS and Apache Geode Data Movement: Low Latency ETL Pipeline by Using Cloud-Native Event Driven Microservices
Paul Warren, Heather Riddle discuss how to create cloud-native event driven microservices for RDBMS and Apache Geode by using Cloud Foundry, Spring Cloud Stream, and RabbitMQ/Kafka.
-
Software Design for Persistent Memory Systems
Howard Chu talks about both naive approaches to leveraging NVRAM, and reasons to avoid those approaches, as well as optimal, proven methods for building systems around persistent memory.
-
The Future of Distributed Databases Is Relational
Sumedh Pathak talks about his team’s journey to create a more modern relational database, distributed systems, scaling Postgres, distributed query planner and the distributed deadlock detection.
-
Streaming SQL to Unify Batch & Stream Processing w/ Apache Flink @Uber
Shuyi Chen and Fabian Hueske explore SQL’s role in the world of streaming data and its implementation in Apache Flink and covering streaming semantics, event time, and incremental results.
-
JDBC, What Is It Good For?
Thomas Risberg reviews what has been happening in the world of JDBC lately and how it fits with the latest trends like Reactive, Non-Blocking APIs, Microservices, CQRS and Event Sourcing.
-
Gimel: PayPal’s Analytics Data Platform
Deepak Chandramouli introduces and demos Gimel, a unified analytics data platform which provides access to any storage through a single unified data API and SQL.