InfoQ Homepage Database Content on InfoQ

Articles

RSS Feed

Newer Older

Architecture & Design

A Critique of Resizable Hash Tables: Riak Core & Random Slicing

This fall, Wallaroo Labs will be releasing a large new feature set to our distributed data stream processing framework, Wallaroo. One of the new features requires a size-adjustable, distributed data structure to support growing & shrinking of compute clusters. It might be a good idea to use a distributed hash table to support the new feature, but what distributed hash algorithm should we choose?

Scott Lystig Fritchie
on Aug 26, 2018
AI, ML & Data Engineering

How to Choose a Stream Processor for Your App

Choosing a stream processor for your app can be challenging with many options to choose from. The best choice depends on individual use cases. In this article, the authors discuss a stream processor reference architecture, key features required by most streaming applications and optional features that can be selected based on specific use cases.

Miyuru Dayarathna Srinath Perera
on Aug 21, 2018
AI, ML & Data Engineering

Analyzing and Preventing Unconscious Bias in Machine Learning

This article is based on Rachel Thomas’s keynote presentation, “Analyzing & Preventing Unconscious Bias in Machine Learning” at QCon.ai 2018. Thomas talks about the pitfalls and risk the bias in machine learning brings to the decision-making process. She discusses three use cases of machine learning bias.

Srini Penchikala
on Aug 14, 2018
Culture & Methods

Q&A on the Book Testing in the Digital Age

The Book Testing in the Digital Age by Tom van de Ven, Rik Marselis, and Humayun Shaukat, explains the impact that developments like robotics, artificial intelligence, internet of things, and big data are having in testing. It explores the challenges and possibilities that the digital age brings us when it comes to testing software systems.

Tom van de Ven Ben Linders
on Jul 19, 2018
AI, ML & Data Engineering

Democratizing Stream Processing with Apache Kafka and KSQL - Part 1

In this article, author Michael Noll discusses the stream processing with KSQL, the streaming SQL engine for Apache Kafka. Topics covered include challenges of stateful stream processing and how KSQL addresses them, and how KSQL helps to bridge the world of streams and databases through streams and tables.

Michael Noll
on Jun 15, 2018
Development

Picking an Active-Active Geo Distribution Strategy: Comparing Merge Replication and CRDT

Modern distributed applications are fuelling the growing demand for distributed active-active, multi-master databases. While most popular databases support multi-master deployment, different databases employ different techniques. LWW, MVCC, merge replication and CRDTs deliver eventual consistency, offering read and write access with local latency and remaining available during network partitions.

Roshan Kumar
on Jun 12, 2018
AI, ML & Data Engineering

Columnar Databases and Vectorization

In this article, author Siddharth Teotia discusses the Dremio database which is based on Apache Arrow with vectorization capabilities.

Siddharth Teotia
on May 27, 2018
Culture & Methods

Q&A on the Book Software Wasteland

Almost all Enterprise Information Systems now cost vastly more to implement than they should. When you have hundreds or thousands of complex applications, you are stuck in the Application Centric Quagmire. In the book Software Wasteland Dave McComb explores what is causing application development waste and how visualizing the cost of change and becoming data-centric can help to reduce the waste.

Ben Linders
on May 07, 2018
DevOps

Monitoring SRE's Golden Signals

Golden signals are increasingly popular these days due to the rise of SRE. This article outlines what golden signals are, and how to monitor and use them in the context of various common services.

Steve Mushero
on Apr 27, 2018
Architecture & Design

Polyglot Persistence Powering Microservices

At Netflix, the cloud database engineering team is responsible for providing several flavors of data persistence as a service to microservice development teams. Roopa Tangirala explained how her team has created self-service tools that help developers easily implement the appropriate data store for each project's needs.

Thomas Betts Roopa Tangirala
on Apr 10, 2018
DevOps

GDPR for Operations

With GDPR, taking care of personal data is an organisation-wide responsibility, but in the operations we can provide a lot of supporting tools to help deal with the multiple facets of this problem.

Jon Topper
on Mar 10, 2018
DevOps

Why and How Database Changes Should Be Included in the Deployment Pipeline

Eduardo Piairo on why databases and applications should coexist in the same deployment pipeline and different scenarios and steps to achieve it.

Eduardo Piairo
on Jan 30, 2018

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles