BT

Apache Flink 1.0.0 is Released

by Rags Srinivas on  Mar 24, 2016

InfoQ's Rags Srinivas caught up with Stephan Ewen, a project committer for Apache Flink about the 1.0.0 Release and the roadmap

Databricks Integrates Spark and TensorFlow for Deep Learning

by Dylan Raithel on  Mar 12, 2016

Since announcements late last year about Google open-sourcing TensorFlow, the company’s open-source library for machine learning, and previous coverage at InfoQ, the data-science community has had an opportunity to try out TensorFlow for their own projects.

Funnel Analysis at Twitter for Improving User Engagement

by Srini Penchikala on  Feb 25, 2016

Funnel analysis is used to analyze a sequence of events to help with user engagement on a website or a mobile application. Data Science team at Twitter uses this concept to learn how users interact with user interfaces during sign up or tweeting for improving user engagement with Twitter.

AlphaGo: Google and DeepMind Publish Seminal AI Work

by Dylan Raithel on  Feb 23, 2016

A game simulation at Google's Deep Mind defeated expert humans at Go last month in a breakthrough for AI. Go is considered one of the great unsolved problems in AI.

Benchmarking Netflix Dynomite with Redis on AWS

by Alex Giamas on  Feb 03, 2016

Last year, Netflix Cloud Database Engineering (CDE) team introduced Dynomite. Dynomite is a proxy layer, aiming to turn any non-distributed database into a sharded, multi-region replication aware distributed database system. Now Netflix released a benchmark using Dynomite with Redis in AWS infrastructure.

How Airbnb Uses Net Promoter Score to Predict Guest Rebooking

by Srini Penchikala on  Feb 02, 2016 1

Net Promoter Score (NPS) is a customer loyalty metric used to determine the likelihood that a customer will return to a company's website or use their service again. Airbnb uses NPS extensively in measuring the customer loyalty, as a more effective measurement to determine the likelihood that a customer will return to book again or recommend the company to their friends.

Hazelcast Version 3.6 Features Performance Improvements and Cloud Management

by Victor Grazi on  Jan 25, 2016

Hazelcast has released version 3.6 of their flagship in-memory grid and caching software, featuring numerous performance improvements and new cloud management and container deployment options.

Yahoo Open-Sources DataSketches for Faster Operations Over Streams

by Abraham Marín Pérez on  Jan 20, 2016

Yahoo has open-sourced DataSketches, a library written in Java for stochastic streaming algorithms. DataSketches is able to perform traditionally expensive operations, like counting distinct occurrences of a variable within a stream, using a fraction of time and memory and with a predictable error margin.

Riley Newman on How Airbnb Uses Data Science

by Jérôme Serrano on  Jan 10, 2016

Riley Newman, head of data science at Airbnb, recently published an article describing how the Californian startup defines and uses data science. He explains that data can be seen as the voice of the customers, and data science as an act of interpretation. He also details several initiatives that have been particularly important for scaling data science.

Yahoo! Benchmarks Apache Flink, Spark and Storm

by Abel Avram on  Dec 23, 2015

Yahoo! has benchmarked three of the main stream processing frameworks: Apache Flink, Spark and Storm.

MongoDB Hits 3.2 and Becomes Enterprise Ready

by Alex Giamas on  Nov 25, 2015

MongoDB recently announced the newest version of its NoSQL database synonymous product. Building upon the new features introduced in 3.0 release, 3.2 is expanding and solidifying MongoDB’s interest towards the corporate world.

IBM Commits to Advance Apache Spark

by Alex Giamas on  Nov 20, 2015

Earlier last month in Las Vegas, at IBM Insight 2015, IBM announced a major commitment to the Apache Spark project. Referring to it as “potentially the most significant open source project of the next decade” tells a lot about how important IBM believes Apache Spark is. With IDC reporting that 80% of cloud applications in the future will be data intensive, Apache Spark can unlock previously...

DMTK, a Machine Learning Toolkit from Microsoft

by Abel Avram on  Nov 13, 2015

About the same time Google announced open sourcing TensorFlow, Microsoft has pushed to GitHub DMTK, a Distributed Machine Learning Toolkit. While Google has released a one-machine version of TensorFlow, DMTK runs on a cluster of machines.

TensorFlow: Google Open Sources Their Machine Learning Tool

by Abel Avram on  Nov 09, 2015

TensorFlow is a machine learning library created by the Brain Team researchers at Google and now open sourced under the Apache License 2.0. TensorFlow is detailed in the whitepaper TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. The source code can be found on Google Git.

Teradata Announces New Software for Real-Time Analysis of Internet of Things Data

by Kevin Farnham on  Nov 06, 2015 1

At its 2015 Partners User Group Conference, Teradata announced two new software capabilities for real-time ingestion and analysis of massive streams of IoT data. While the Teradata Listener software enables "listening" to multiple, diverse IoT data streams in real time, the new Teradata Aster Analytics on Hadoop software provides scalable analysis of massive IoT data streams.

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2016 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT

We notice you’re using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.