InfoQ Homepage Database Content on InfoQ
-
Streaming Log Analytics with Kafka
Kresten Thorup discusses how and why they use Kafka internally and demos how they utilize it as a straightforward event-sourcing model for distributed deployments.
-
Papers in Production Lightning Talks
Papers: Towards a Solution to the Red Wedding Problem, A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise, and A Machine Learning Approach to Databases Indexes.
-
Apache Metron in the Real World – Big Data and Cybersecurity, a Perfect Match
Dave Russell takes a look at a number of different organizations who are on their big data cybersecurity journey with Apache Metron.
-
Petastorm: A Light-Weight Approach to Building ML Pipelines
Yevgeni Litvin describes how Petastorm facilitates tighter integration between Big Data and Deep Learning worlds, simplifies data management and data pipelines, and speeds up model experimentation.
-
Choosing Kubernetes: Managing Risk in Cloud Infrastructure
Ben Butler-Cole talks about Neo4j’s use of Kubernetes as a foundation for their stateful service: why they chose it and how they handled the risks associated with that choice.
-
Applying Deep Learning to Airbnb Search
Malay Haldar discusses the work done in applying neural networks at Airbnb to improve the search beyond the results obtained with ML.
-
People You May Know: Fast Recommendations over Massive Data
Sumit Rangwala and Felix GV present the evolution of PYMK’s architecture, focusing on Gaia, a real-time graph computing capability, and Venice, an online feature store with scoring capability.
-
Life of a Distributed Graph Database Query
Teon Banek describes the life of a query in Memgraph following the process from reading a query as a character string, through planning and distributed execution of query operations.
-
Productionizing H2O Models with Apache Spark
Jakub Hava demonstrates the creation of pipelines integrating H2O machine learning models and their deployments using Scala or Python.
-
YugaByte DB - A Planet-Scale Database for Low Latency Transactional Apps
Amey Banarse and Karthik Ranganathan introduce and demo YugaByte DB, a large scale DB, highlighting distributed transactions with global consistency.
-
Winning Ways for Your Visualization Plays
Mark Grundland explores practical techniques for information visualization design to take better account of the fundamental limitations of visual perception.
-
Fast and Furious: Searching in a Distributed World with Highly Available Spring Data Redis
Julien Ruaux discusses the Redis Enterprise architecture and demos Redis clusters, builds three microservices, performs full-text searches, and views the results using Spring Boot Web and Angular.