BT
  • Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines

    by Dylan Raithel on  Aug 29, 2016

    InfoQ Interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline

  • Christine Doig on Data Science as a Team Discipline

    by Srini Penchikala on  Aug 26, 2016

    Christine Doig spoke at this year's OSCON Conference about data science as a team discipline and how to navigate the data science Python ecosystem. InfoQ spoke with Christine about challenges data science teams need to address to be more effective.

  • Book Review and Excerpt: Infrastructure as Code

    by Abel Avram on  Jul 25, 2016

    In this article we review the book Infrastructure as Code - Managing Servers in the Cloud written by Kief Morris, who is leading Continuous Delivery and DevOps at ThoughtWorks Europe. In over 300 pages, Morris lays down the foundation for Infrastructure as Code and outlines the main patterns and practices recommended for building it.

Big Data Analytics with Spark Book Review and Interview

Posted by Srini Penchikala on  Jun 23, 2016

Big Data Analytics with Spark, authored by Mohammed Guller, provides a practical guide for learning Apache Spark. InfoQ and the author discuss the book & development tools for big data applications.

Big Data Processing with Apache Spark - Part 4: Spark Machine Learning

Posted by Srini Penchikala on  May 15, 2016

In this fourth installment of Apache Spark article series, author Srini Penchikala discusses machine learning concept & Spark MLlib library for running predictive analytics using a sample application.

The Holistic Approach: Preventing Software Disasters

Posted by Olivier Bonsignour on  Apr 28, 2016

Olivier Bonsignour on what "X-Raying" software means, how it can help prevent software disasters and why CIOs should care. 3

The Role of a Data Scientist in 2016

Posted by Ed Jones on  Mar 27, 2016

Data Science has been getting lot of attention as organizations are starting to use data analytics to gain insights into their data. This article takes a closer look at Data Scientist role in 2016.

Unified Data Modeling for Relational and NoSQL Databases

Posted by Allen Wang on  Feb 28, 2016

Current enterprise data architectures include NoSQL databases co-existing with RDBMS. In this article, author discusses a solution for managing NoSQL & relational data using unified data modeling. 5

Sourcing Security Superheroes: Part II: How Policy Can Enhance, Rather Than Hinder, Breach Detection

Posted by Monzy Merza on  Feb 11, 2016

In theory, security policies protect organizations, stakeholders, and users. But in practice, organizations become more concerned with meeting these standards than protecting the business.

Getting Ready for IoT’s Big Data Challenges with Couchbase Mobile

Posted by Ralph Winzinger on  Jan 20, 2016

Our physical world is about to become digitally enabled and according to various predictions, there will be many billions of IoT devices going online and collecting data in the coming years. 2

Big Data Processing with Apache Spark - Part 3: Spark Streaming

Posted by Srini Penchikala on  Jan 07, 2016

In this article, third installment of Apache Spark series, author discusses Apache Spark Streaming framework for processing real-time streaming data using a log analytics sample application. 7

Health Informatics and Survival Prediction of Cancer with Apache Spark Machine Learning Library

Posted by Konur Unyelioglu on  Dec 22, 2015

In this article, author discusses the survival prediction of colorectal cancer as a multi-class classification problem and how to solve that problem using the Apache Spark's MLlib Java API.

BT