InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

Articles

RSS Feed

Newer Older

Aerospike NoSQL Database Architecture

Aerospike is an open source distributed Key-Value NoSQL database. It supports flexible data schemas and ACID transactions. InfoQ spoke with Brian Bulkowski, Aerospike co-founder and CTO, about the NoSQL database architecture, advantages and its limitations.

Srini Penchikala
on Sep 25, 2014
AI, ML & Data Engineering

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), when stream processing makes sense, and what technologies and products you can choose from.

Kai Wähner
on Sep 10, 2014
Nikita Ivanov on GridGain’s In-Memory Accelerator for Hadoop

GridGain recently announced the In-Memory Accelerator for Hadoop, offering the benefits of in-memory computing to Hadoop based applications. It includes two components: an in-memory file system and a MapReduce implementation. InfoQ spoke with Nikita Ivanov, CTO of GridGain about the architecture of the product.

Srini Penchikala
on Sep 08, 2014
Apache CouchDB: The Definitive Introduction

Apache CouchDB is an open source document NoSQL database that uses JSON for storing documents. In this article, Jan Lehnardt gives an overview of CouchDB, its architecture and what problems it aims to solve and why it is different from all other databases.

Jan Lehnardt
on Aug 28, 2014
Practical Cassandra: A Developer's Approach - Book Review and Interview

Practical Cassandra: A Developer's Approach book by Russell Bradberry and Eric Lubow, is a developer's guide to build applications using Cassandra NoSQL database. InfoQ spoke with the authors about the book, Cassandra data model, design considerations and how Cassandra performs concurrency and versioning of the data sets.

Srini Penchikala
on Aug 21, 2014
Java

Introducing Spring XD, a Runtime Environment for Big Data Applications

Spring XD (eXtreme Data) is Pivotal’s Big Data play. It joins Spring Boot and Grails as part of the execution portion of the Spring IO platform. Whilst Spring XD makes use of a number of existing Spring projects it is a runtime environment rather than a library or framework, comprising a bin directory with servers that you start up and interact with via a shell.

Charles Humble
on Jul 23, 2014
Cindy Walker on Data Management Best Practices and Data Analytics Center of Excellence

Cindy Walker spoke at Enterprise Data World Conference about using semantic approaches to augment the data management practices. InfoQ spoke with her about the data management best practices and the data analytics center of excellence initiative.

Srini Penchikala
on Jul 13, 2014
Data Modeling with Key Value NoSQL Data Stores – Interview with Casey Rosenthal

In Key Value data stores, data is represented as a collection of key–value pairs. The key–value model is one of the simplest non-trivial data models, and richer data models are implemented on top of it. InfoQ spoke with Casey Rosenthal from Basho team about the data modeling concepts and best practices when using these NoSQL databases for data management.

Srini Penchikala
on Jun 25, 2014
Rich Reimer on SQL-on-Hadoop Databases and Splice Machine

SQL-on-Hadoop technologies include a SQL layer or a SQL database over Hadoop. These solutions are becoming popular recently as they solve the data management issues of Hadoop and provide a scale-out alternative for traditional RDBMSs. InfoQ spoke with Rich Reimer, VP of Marketing and Product Management at Splice Machine about the architecture and data patterns for SQL in Hadoop databases.

Srini Penchikala
on Jun 19, 2014
Transactional NoSQL Database

Document-oriented NoSQL databases are eliminating the impedance mismatch between developers and traditional data models. However developers have come to believe they need to sacrifice ACID transactions. In this article we will look at how MarkLogic dispels this myth

Ken Krupa
on Jun 12, 2014
Architecture & Design

Apache Kafka: Next Generation Distributed Messaging System

Apache Kafka is a distributed publish-subscribe messaging system. This article covers the architecture model, features and characteristics of Kafka framework and how it compares with traditional messaging systems.

Abhishek Sharma
on Jun 04, 2014
Data Modeling in Graph Databases: Interview with Jim Webber and Ian Robinson

Data modeling with Graph databases requires a different paradigm than modeling in Relational or other NoSQL databases like Document databases, Key Value data stores, or Column Family databases. InfoQ spoke with Jim Webber and Ian Robinson about data modeling efforts when using Graph databases.

Srini Penchikala
on May 24, 2014

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Articles