InfoQ Homepage Big Data Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Benchmarking Netflix Dynomite with Redis on AWS

Last year, Netflix Cloud Database Engineering (CDE) team introduced Dynomite. Dynomite is a proxy layer, aiming to turn any non-distributed database into a sharded, multi-region replication aware distributed database system. Now Netflix released a benchmark using Dynomite with Redis in AWS infrastructure.

Alex Giamas
on Feb 03, 2016
AI, ML & Data Engineering

How Airbnb Uses Net Promoter Score to Predict Guest Rebooking

Net Promoter Score (NPS) is a customer loyalty metric used to determine the likelihood that a customer will return to a company's website or use their service again. Airbnb uses NPS extensively in measuring the customer loyalty, as a more effective measurement to determine the likelihood that a customer will return to book again or recommend the company to their friends.

Srini Penchikala
on Feb 02, 2016
Java

Yahoo Open-Sources DataSketches for Faster Operations Over Streams

Yahoo has open-sourced DataSketches, a library written in Java for stochastic streaming algorithms. DataSketches is able to perform traditionally expensive operations, like counting distinct occurrences of a variable within a stream, using a fraction of time and memory and with a predictable error margin.

Abraham Marín Pérez
on Jan 20, 2016
AI, ML & Data Engineering

Riley Newman on How Airbnb Uses Data Science

Riley Newman, head of data science at Airbnb, recently published an article describing how the Californian startup defines and uses data science. He explains that data can be seen as the voice of the customers, and data science as an act of interpretation. He also details several initiatives that have been particularly important for scaling data science.

Jérôme Serrano
on Jan 10, 2016
AI, ML & Data Engineering

MongoDB Hits 3.2 and Becomes Enterprise Ready

MongoDB recently announced the newest version of its NoSQL database synonymous product. Building upon the new features introduced in 3.0 release, 3.2 is expanding and solidifying MongoDB’s interest towards the corporate world.

Alex Giamas
on Nov 25, 2015
AI, ML & Data Engineering

IBM Commits to Advance Apache Spark

Earlier last month in Las Vegas, at IBM Insight 2015, IBM announced a major commitment to the Apache Spark project. Referring to it as “potentially the most significant open source project of the next decade” tells a lot about how important IBM believes Apache Spark is. With IDC reporting that 80% of cloud applications in the future will be data intensive, Apache Spark can unlock previously...

Alex Giamas
on Nov 20, 2015
AI, ML & Data Engineering

DMTK, a Machine Learning Toolkit from Microsoft

About the same time Google announced open sourcing TensorFlow, Microsoft has pushed to GitHub DMTK, a Distributed Machine Learning Toolkit. While Google has released a one-machine version of TensorFlow, DMTK runs on a cluster of machines.

Abel Avram
on Nov 13, 2015
AI, ML & Data Engineering

TensorFlow: Google Open Sources Their Machine Learning Tool

TensorFlow is a machine learning library created by the Brain Team researchers at Google and now open sourced under the Apache License 2.0. TensorFlow is detailed in the whitepaper TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. The source code can be found on Google Git.

Abel Avram
on Nov 09, 2015
Development

Teradata Announces New Software for Real-Time Analysis of Internet of Things Data

At its 2015 Partners User Group Conference, Teradata announced two new software capabilities for real-time ingestion and analysis of massive streams of IoT data. While the Teradata Listener software enables "listening" to multiple, diverse IoT data streams in real time, the new Teradata Aster Analytics on Hadoop software provides scalable analysis of massive IoT data streams.

Kevin Farnham
on Nov 06, 2015
AI, ML & Data Engineering

DistributedLog at Twitter for High Performance Logging

Twitter is using replicated logs for high performance data collection and analysis of its systems. DistributedLog is the system developed at Twitter for this purpose. Twitter has developed a distributed key-value database, Manhattan. Manhattan can trade consistency for latency in reads following the eventually consistent data model. We examine Twitter's design and tradeoffs for DistributedLog.

Alex Giamas
on Oct 20, 2015
AI, ML & Data Engineering

Amazon Announces QuickSight - Business Intelligence for Big Data on AWS

Amazon has announced QuickSight at AWS Re:invent conference. QuickSight a complete Business Intelligence solution to help customers gain insights from the data they have stored in AWS.

Matt Kapilevich
on Oct 09, 2015
Cloud

Salesforce Enters IoT Market

At Salesforce’s recent Dreamforce conference, the company announced an upcoming IoT platform that will allow for the ingestion of real time data and turn it into actionable tasks across its suite of cloud based services.

Kent Weare
on Oct 01, 2015
Architecture & Design

Hortonworks Addresses the IoAT with DataFlow Based on NiFi

Hortonworks has quietly made available the DataFlow platform which is based on Apache NiFi and attempts to solve the processing needs of the IoAT.

Abel Avram
on Sep 25, 2015
AI, ML & Data Engineering

SpringXD being Re-architected and Re-branded to Spring Cloud Data Flow

Pivotal announced a complete re-design of Spring XD, its big data offering, during last week’s SpringOne2GX conference, with a corresponding re-brand from Spring XD to Spring Cloud Data Flow. The new product is focussed on orchestration.

Charles Humble
on Sep 25, 2015
AI, ML & Data Engineering

Splunk for DBAs

The DBA’s primary job is to ensure that the business’s information is always available, with performance coming in at close second. We’ve already talked about optimizing distributed queries in Splunk and map-reduce queries in Hunk. In this report we expand upon that with more information that a DBA needs to know about Splunk databases.

Jonathan Allen
on Sep 24, 2015

Newer News

Older News

InfoQ Software Architects' Newsletter

News