BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Older rss
  • Apache Beam Interview with Frances Perry

    by Dylan Raithel on  Jun 20, 2017

    InfoQ Interviews Apache Beam's Frances Perry about the impetus for using Beam and the future of the top-level open source project and covers the thoughts behind the programming model as well as some of the touch-points in integration with other data engineering tools like Apache Spark and Flink.

  • Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included! (Part 2)

    by Mohammad Rezaei on  Jun 13, 2017

    Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Continuing our exploration of Reladomo, the primary Java ORM used at GS and now open source, GS Technology Fellow, Mohammad Rezaei looks at advanced features, such as sharding, caching, bitemporal access, performance, and testing.

  • Machine Learning Techniques for Predictive Maintenance

    by Srinath Perera Roshan Alwis on  May 21, 2017

    In this article, the authors explore how we can build a machine learning model to do predictive maintenance of systems. They discuss a sample application using NASA engine failure dataset to predict the Remaining Useful Time (RUL) with regression models.

Predicting Movie Ratings: NLP Tools is What Film Studios Need

Posted by Tatsiana Levdikova on  May 13, 2017

In this article, the author discusses how to use Natural Language Processing (NLP) techniques to predict the movie ratings using the data shared on social media platforms.

From Alibaba to Apache: RocketMQ’s Past, Present, and Future

Posted by Wang Xiaorui Feng Jia on  Apr 21, 2017

Feng Jia and Wang Xiaorui share the core distributed systems principals behind RocketMQ, Alibaba's distributed messaging and data streaming platform now open sourced through the Apache Foundation.

Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included!

Posted by Mohammad Rezaei on  Mar 28, 2017

Reladomo, the primary Java ORM used at leading investment bank Goldman Sachs, is now open source. In this article GS Technology Fellow Mohammad Rezaei takes us on a deep dive into Reladomo. 4

Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX

Posted by Srini Penchikala on  Mar 14, 2017

In this article, author discusses Apache Spark GraphX used for graph data processing and analytics, with sample code for graph algorithms like PageRank, Connected Components and Triangle Counting. 2

Three Experts on Big Data Engineering

Posted by Clemens Szyperski Martin Petitclerc Roger Barga on  Mar 12, 2017

Clemens Szyperski (Microsoft), Martin Petitclerc (IBM), and Roger Barga (Amazon Web Services) talk about challenges when building scalable, big data systems, and how to address them.

Data Preprocessing vs. Data Wrangling in Machine Learning Projects

Posted by Kai Wähner on  Mar 05, 2017

This article compares different alternative techniques to prepare data, including extract-transform-load (ETL) batch processing, streaming ingestion and data wrangling.

Learning Paths: QCon London Expert Recommendations

Posted by Wesley Reisz on  Feb 16, 2017

Advice on the best talks to attend at QCon London 2017 from London Thought Leaders.

Q&A with Immuta on the Implications of EU’s General Data Protection Regulation (GDPR)

Posted by Manuel Pais on  Feb 10, 2017

InfoQ talked with Immuta’s Andrew Burt and Steve Touw, to better understand the implications and challenges of the EU's Global Data Protection Regulation, which will come into effect in May 2018.

Cassandra: The Definitive Guide, 2nd Edition Book Review and Interview

Posted by Srini Penchikala on  Jan 05, 2017

Cassandra: The Definitive Guide, 2nd Edition book authored by Jeff Carpenter and Eben Hewitt covers the Cassandra NoSQL database version 3.0. InfoQ spoke with the co-author Jeff Carpenter.

BT