• AI, ML & Data Engineering Follow 954 Followers

    Columnar Databases and Vectorization

    by Siddharth Teotia Follow 0 Followers on  May 27, 2018

    In this article, author Siddharth Teotia discusses the Dremio database which is based on Apache Arrow with vectorization capabilities.

  • Culture & Methods Follow 759 Followers

    Q&A on the Book Software Wasteland

    by Ben Linders Follow 27 Followers on  May 07, 2018

    Almost all Enterprise Information Systems now cost vastly more to implement than they should. When you have hundreds or thousands of complex applications, you are stuck in the Application Centric Quagmire. In the book Software Wasteland Dave McComb explores what is causing application development waste and how visualizing the cost of change and becoming data-centric can help to reduce the waste.

  • DevOps Follow 919 Followers

    What Do Data Scientists and Data Engineers Need to Know about GDPR?

    by Andrew Burt Follow 0 Followers on  Jan 27, 2018 3

    Andrew Burt on the implications of GDPR on data collection, storage and use for any organization dealing with customer data in the EU. Burt explains what's the minimum an org needs to pass the GDPR test, as well as how to take the opportunity to improve their overall data governance.

AI, ML & Data Engineering Follow 954 Followers

Big Data Processing with Apache Spark – Part 1: Introduction

Posted by Srini Penchikala Follow 36 Followers on  Jan 30, 2015

Apache Spark is an open source big data framework built around speed, ease of use, and sophisticated analytics. In this article, Srini Penchikala discusses how Spark helps with big data processing. 8


Improving Data Management with the DMM

Posted by Ben Linders Follow 27 Followers on  Sep 15, 2014

The CMMI Institute has launched the Data Management Maturity (DMM)SM model. It can be used to improve data management, helping organizations to bridge the gap between business and IT.


Cindy Walker on Data Management Best Practices and Data Analytics Center of Excellence

Posted by Srini Penchikala Follow 36 Followers on  Jul 13, 2014

Cindy Walker spoke at Enterprise Data World Conference about using semantic approaches to augment data management practices. InfoQ spoke with her about these best practices and data analytics.


Interview and Book Review: NoSQL Distilled

Posted by Srini Penchikala Follow 36 Followers on  Nov 29, 2012

InfoQ spoke with NoSQL Distilled book authors, Pramod Sadalage and Martin Fowler about NoSQL database space and the emerging trends in NoSQL.

Login to InfoQ to interact with what matters most to you.

Recover your password...


Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.


More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.


Stay up-to-date

Set up your notifications and don't miss out on content that matters to you