InfoQ Homepage Database Content on InfoQ

Articles

RSS Feed

Newer Older

Architecture & Design

Challenges of Building a Reliable Realtime Chat Service

Realtime chat has become a common feature of modern applications. These days not only communicators and social networks allow users to talk with each other over the Internet—chat is crucial in healthcare, e-commerce, gaming and many other industries.

Pawel Ledwon
on Nov 09, 2018
AI, ML & Data Engineering

Seth James Nielson on Blockchain Technology for Data Governance

Seth James Nielson recently hosted a tutorial workshop at Data Architecture Summit 2018 Conference about Blockchain technology and its impact on data architecture and data governance.

Srini Penchikala Seth James Nielson
on Nov 07, 2018
AI, ML & Data Engineering

Apache Kafka: Ten Best Practices to Optimize Your Deployment

Author Ben Bromhead discusses the latest Kafka best practices for developers to manage the data streaming platform more effectively. Best practices include log configuration, proper hardware usage, Zookeeper configuration, replication factor, and partition count.

Ben Bromhead
on Oct 19, 2018
AI, ML & Data Engineering

Natural Language Processing with Java - Second Edition: Book Review and Interview

Natural Language Processing with Java - Second Edition book covers the Natural Language Processing (NLP) topic and various tools developers can use in their applications. Technologies discussed in the book include Apache OpenNLP and Stanford NLP. InfoQ spoke with co-author Richard Reese about the book and how NLP can be used in enterprise applications.

Srini Penchikala
on Oct 10, 2018
Development

14 Things I Wish I’d Known When Starting with MongoDB

I’ve been a database person for an embarrassing length of time, but I only started working with MongoDB recently. When I was starting out with MongoDB, there are a few things that I wish I’d known about. With general experience, there will always be preconceptions of what databases are and what they do. In hopes of making it easier for other people, here is a list of common mistakes.

Phil Factor
on Sep 13, 2018
AI, ML & Data Engineering

Democratizing Stream Processing with Apache Kafka® and KSQL - Part 2

In this article, author Robin Moffatt shows how to use Apache Kafka and KSQL to build data integration and processing applications with the help of an e-commerce sample application. Three use cases discussed: customer operations, operational dashboard, and ad-hoc analytics.

Robin Moffatt
on Sep 07, 2018
Architecture & Design

A Critique of Resizable Hash Tables: Riak Core & Random Slicing

This fall, Wallaroo Labs will be releasing a large new feature set to our distributed data stream processing framework, Wallaroo. One of the new features requires a size-adjustable, distributed data structure to support growing & shrinking of compute clusters. It might be a good idea to use a distributed hash table to support the new feature, but what distributed hash algorithm should we choose?

Scott Lystig Fritchie
on Aug 26, 2018
AI, ML & Data Engineering

How to Choose a Stream Processor for Your App

Choosing a stream processor for your app can be challenging with many options to choose from. The best choice depends on individual use cases. In this article, the authors discuss a stream processor reference architecture, key features required by most streaming applications and optional features that can be selected based on specific use cases.

Miyuru Dayarathna Srinath Perera
on Aug 21, 2018
AI, ML & Data Engineering

Analyzing and Preventing Unconscious Bias in Machine Learning

This article is based on Rachel Thomas’s keynote presentation, “Analyzing & Preventing Unconscious Bias in Machine Learning” at QCon.ai 2018. Thomas talks about the pitfalls and risk the bias in machine learning brings to the decision-making process. She discusses three use cases of machine learning bias.

Srini Penchikala
on Aug 14, 2018
Culture & Methods

Q&A on the Book Testing in the Digital Age

The Book Testing in the Digital Age by Tom van de Ven, Rik Marselis, and Humayun Shaukat, explains the impact that developments like robotics, artificial intelligence, internet of things, and big data are having in testing. It explores the challenges and possibilities that the digital age brings us when it comes to testing software systems.

Tom van de Ven Ben Linders
on Jul 19, 2018
AI, ML & Data Engineering

Democratizing Stream Processing with Apache Kafka and KSQL - Part 1

In this article, author Michael Noll discusses the stream processing with KSQL, the streaming SQL engine for Apache Kafka. Topics covered include challenges of stateful stream processing and how KSQL addresses them, and how KSQL helps to bridge the world of streams and databases through streams and tables.

Michael Noll
on Jun 15, 2018
Development

Picking an Active-Active Geo Distribution Strategy: Comparing Merge Replication and CRDT

Modern distributed applications are fuelling the growing demand for distributed active-active, multi-master databases. While most popular databases support multi-master deployment, different databases employ different techniques. LWW, MVCC, merge replication and CRDTs deliver eventual consistency, offering read and write access with local latency and remaining available during network partitions.

Roshan Kumar
on Jun 12, 2018

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles