BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Older rss
  • Data Science Follow 126 Followers

    Apache Beam Interview with Frances Perry

    by Dylan Raithel Follow 2 Followers on  Jun 20, 2017

    InfoQ Interviews Apache Beam's Frances Perry about the impetus for using Beam and the future of the top-level open source project and covers the thoughts behind the programming model as well as some of the touch-points in integration with other data engineering tools like Apache Spark and Flink.

  • Data Science Follow 126 Followers

    Introducing FaunaDB Serverless Cloud

    by Matt Freels Follow 1 Followers on  Jun 14, 2017 1

    FaunaDB Serverless Cloud is the managed version of FaunaDB, a serverless, object-relational, globally replicated, strongly consistent, temporal database, that can be deployed on multiple clouds, such as AWS, GCP, and Azure, or on premises.

  • Java Follow 114 Followers

    Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included! (Part 2)

    by Mohammad Rezaei Follow 0 Followers on  Jun 13, 2017

    Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Continuing our exploration of Reladomo, the primary Java ORM used at GS and now open source, GS Technology Fellow, Mohammad Rezaei looks at advanced features, such as sharding, caching, bitemporal access, performance, and testing.

Data Science Follow 126 Followers

Machine Learning Techniques for Predictive Maintenance

Posted by Srinath Perera Follow 0 Followers , Roshan Alwis Follow 0 Followers on  May 21, 2017

In this article, the authors explore how we can build a machine learning model to do predictive maintenance of systems using NASA engine failure dataset.

Data Science Follow 126 Followers

Predicting Movie Ratings: NLP Tools is What Film Studios Need

Posted by Tatsiana Levdikova Follow 0 Followers on  May 13, 2017

In this article, the author discusses how to use Natural Language Processing (NLP) techniques to predict the movie ratings using the data shared on social media platforms.

Data Science Follow 126 Followers

Pascal Desmarets on NoSQL Data Modeling Best Practices

Posted by Srini Penchikala Follow 6 Followers on  May 01, 2017

NoSQL databases are designed to store different types of data like Key Value, Documents, Time Series, Graph & IoT. Pascal Desmarets talks about how to do data modeling when using NoSQL databases. 1

Data Science Follow 126 Followers

From Alibaba to Apache: RocketMQ’s Past, Present, and Future

Posted by Wang Xiaorui Follow 0 Followers , Feng Jia Follow 0 Followers on  Apr 21, 2017

Feng Jia and Wang Xiaorui share the core distributed systems principals behind RocketMQ, Alibaba's distributed messaging and data streaming platform now open sourced through the Apache Foundation.

Data Science Follow 126 Followers

Building Pipelines for Heterogeneous Execution Environments for Big Data Processing

Posted by Dongyao Wu Follow 0 Followers , Liming Zhu Follow 0 Followers , Xiwei Xu Follow 0 Followers , Sherif Sakr Follow 0 Followers , Daniel Sun Follow 0 Followers , Qinghua Lu Follow 0 Followers on  Mar 31, 2017

The Pipeline61 framework supports the building of data pipelines involving heterogeneous execution environments. It reuses the existing code of the deployed jobs in different environments.

Java Follow 114 Followers

Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included!

Posted by Mohammad Rezaei Follow 0 Followers on  Mar 28, 2017

Reladomo, the primary Java ORM used at leading investment bank Goldman Sachs, is now open source. In this article GS Technology Fellow Mohammad Rezaei takes us on a deep dive into Reladomo. 5

Data Science Follow 126 Followers

Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX

Posted by Srini Penchikala Follow 6 Followers on  Mar 14, 2017

In this article, author discusses Apache Spark GraphX used for graph data processing and analytics, with sample code for graph algorithms like PageRank, Connected Components and Triangle Counting. 2

Data Science Follow 126 Followers

Three Experts on Big Data Engineering

Posted by Clemens Szyperski Follow 0 Followers , Martin Petitclerc Follow 0 Followers , Roger Barga Follow 0 Followers on  Mar 12, 2017

Clemens Szyperski (Microsoft), Martin Petitclerc (IBM), and Roger Barga (Amazon Web Services) talk about challenges when building scalable, big data systems, and how to address them.

Data Science Follow 126 Followers

Data Preprocessing vs. Data Wrangling in Machine Learning Projects

Posted by Kai Wähner Follow 0 Followers on  Mar 05, 2017

This article compares different alternative techniques to prepare data, including extract-transform-load (ETL) batch processing, streaming ingestion and data wrangling.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT