• The Role of a Data Scientist in 2016

    by Ed Jones on  Mar 27, 2016

    Data Scientist role has been getting lot of attention lately as organizations are starting to use big data processing and analytics techniques to gain insights into their data. This post takes a closer look at the role of a Data Scientist in 2016.

  • Spark in Action Book Review & Interview

    by Srini Penchikala on  Mar 18, 2016

    In the "Spark in Action" book, authors Petar Zecevic and Marko Bonaci discuss the Apache Spark framework for data processing (batch and streaming data use cases). They introduce the architecture of Spark and core concepts such as Resilient Distributed Datasets (RDDs). InfoQ spoke with them about Apache Spark, developer tools, and the upcoming features and enhancements in the future releases.

  • Unified Data Modeling for Relational and NoSQL Databases

    by Allen Wang on  Feb 28, 2016 5

    Current enterprise data architectures include NoSQL databases co-existing with relational databases. However, NoSQL data management currently lacks mature methods and tools to manage NoSQL data. In this article, author discusses a solution for managing both NoSQL and relational databases using Unified Data Modeling techniques.

Getting Ready for IoT’s Big Data Challenges with Couchbase Mobile

Posted by Ralph Winzinger on  Jan 20, 2016

Our physical world is about to become digitally enabled and according to various predictions, there will be many billions of IoT devices going online and collecting data in the coming years. 1

Big Data Processing with Apache Spark - Part 3: Spark Streaming

Posted by Srini Penchikala on  Jan 07, 2016

In this article, third installment of Apache Spark series, author discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application. 5

Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library

Posted by Konur Unyelioglu on  Dec 22, 2015

In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.

Data Lake-as-a-Service: Big Data Processing and Analytics in the Cloud

Posted by Srini Penchikala on  Dec 10, 2015

Data Lake-as-a-Service provides big data processing in the cloud for business outcomes in a cost effective way. InfoQ spoke with Lovan Chetty & Hannah Smalltree from Cazena about these solutions work.

Real-time Data Processing in AWS Cloud

Posted by Oleksii Tymchenko on  Nov 11, 2015

In this article, author discusses a bio-informatic software as a service (SaaS) product which was built as a public data warehousing and analytical platform for mass spectrometry data. 3

Oozie Plugin for Eclipse

Posted by Ahmed Mahran on  Oct 30, 2015

A new Eclipse Oozie plugin allows to significantly simplify implementation of Oozie processes by allowing to define them graphically. An article introduces plugin and provides an example of its usage. 1

Big Data Solutions with MS SQL ColumnStore Index

Posted by Aleksandr Shavlyuga on  Oct 11, 2015

ColumnarStore can offer performance improvements over traditional tables, but aren’t always faster. Aleksandr Shavlyuga explores the power, and limitations of SQL Server’s ColumnStore Indexes.

The Estimation Game - Techniques for Informed Guessing

Posted by Carlos Bueno on  Sep 26, 2015

In this article, author Carlos Bueno discusses the strategies for estimating the server capacity for big data projects and initiatives, with the help of two case studies.

Machine Learning and Cognitive Computing

Posted by Seth Earley on  Sep 19, 2015

This article covers machine learning and cognitive computing, and how they are related to artificial intelligence (AI). Panelists discuss how this technology is applied in digital marketing space.

General Feedback
Marketing and all content copyright © 2006-2016 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.