BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

  • Data Science Follow 278 Followers

    Big Data Analytics with Spark Book Review and Interview

    by Srini Penchikala Follow 13 Followers on  Jun 23, 2016

    Big Data Analytics with Spark book, authored by Mohammed Guller, provides a practical guide for learning Apache Spark framework for different types of big-data analytics projects, including batch, interactive, graph, and stream data analysis as well as machine learning. InfoQ spoke with author about the book & development tools for big data applications.

  • Data Science Follow 278 Followers

    Big Data Processing with Apache Spark - Part 4: Spark Machine Learning

    by Srini Penchikala Follow 13 Followers on  May 15, 2016

    In this fourth installment of Apache Spark article series, author Srini Penchikala discusses machine learning concepts and Spark MLlib library for running predictive analytics using a sample application.

  • Data Science Follow 278 Followers

    The Role of a Data Scientist in 2016

    by Ed Jones Follow 0 Followers on  Mar 27, 2016

    Data Scientist role has been getting lot of attention lately as organizations are starting to use big data processing and analytics techniques to gain insights into their data. This post takes a closer look at the role of a Data Scientist in 2016.

Data Science Follow 278 Followers

Unified Data Modeling for Relational and NoSQL Databases

Posted by Allen Wang Follow 0 Followers on  Feb 28, 2016

Current enterprise data architectures include NoSQL databases co-existing with RDBMS. In this article, author discusses a solution for managing NoSQL & relational data using unified data modeling. 5

Mobile Follow 52 Followers

Getting Ready for IoT’s Big Data Challenges with Couchbase Mobile

Posted by Ralph Winzinger Follow 0 Followers on  Jan 20, 2016

Our physical world is about to become digitally enabled and according to various predictions, there will be many billions of IoT devices going online and collecting data in the coming years. 2

Data Science Follow 278 Followers

Big Data Processing with Apache Spark - Part 3: Spark Streaming

Posted by Srini Penchikala Follow 13 Followers on  Jan 07, 2016

In this article, third installment of Apache Spark series, author discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application. 7

Data Science Follow 278 Followers

Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library

Posted by Konur Unyelioglu Follow 0 Followers on  Dec 22, 2015

In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.

Data Science Follow 278 Followers

Data Lake-as-a-Service: Big Data Processing and Analytics in the Cloud

Posted by Srini Penchikala Follow 13 Followers on  Dec 10, 2015

Data Lake-as-a-Service provides big data processing in the cloud for business outcomes in a cost effective way. InfoQ spoke with Lovan Chetty & Hannah Smalltree from Cazena about these solutions work.

Big Data Follow 32 Followers

Real-time Data Processing in AWS Cloud

Posted by Oleksii Tymchenko Follow 0 Followers on  Nov 11, 2015

In this article, author discusses a bio-informatic software as a service (SaaS) product which was built as a public data warehousing and analytical platform for mass spectrometry data. 3

Big Data Follow 32 Followers

Oozie Plugin for Eclipse

Posted by Ahmed Mahran Follow 0 Followers on  Oct 30, 2015

A new Eclipse Oozie plugin allows to significantly simplify implementation of Oozie processes by allowing to define them graphically. An article introduces plugin and provides an example of its usage. 1

Big Data Follow 32 Followers

Big Data Solutions with MS SQL ColumnStore Index

Posted by Aleksandr Shavlyuga Follow 0 Followers on  Oct 11, 2015

ColumnarStore can offer performance improvements over traditional tables, but aren’t always faster. Aleksandr Shavlyuga explores the power, and limitations of SQL Server’s ColumnStore Indexes.

Big Data Follow 32 Followers

The Estimation Game - Techniques for Informed Guessing

Posted by Carlos Bueno Follow 0 Followers on  Sep 26, 2015

In this article, author Carlos Bueno discusses the strategies for estimating the server capacity for big data projects and initiatives, with the help of two case studies.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT