BT

Splice Machine Version 1.0 Supports Integration with Hadoop and Analytic Window Functions

by Srini Penchikala on  Dec 18, 2014

Splice Machine version 1.0 supports analytic window functions and integration with Hadoop ecosystem. Splice Machine team recently released their Hadoop based RDBMS data management solution that can be used for transactional workloads on Hadoop.

LinkedIn Open Sources Cubert With an Eye To Big Data Analytics

by Alex Giamas on  Dec 17, 2014

LinkedIn recently open sourced Cubert, its High Performance Computation Engine for Complex Big Data Analytics. Cubert is a framework written for analysts and data scientists in mind.Developed completely in Java and expressed as a scripting language, Cubert is designed for complex joins and aggregations that frequently arise in the reporting world.

Gobblin, LinkedIn's Unified Data Ingestion Platform

by Mikio Braun on  Dec 15, 2014

At the 2014 QCon San Francisco conference, LinkedIn's Lin Qiao gave a talk on their Gobblin project (also summarized in a blog post) that is a unified data ingestion system for their internal and external data sources.

Stripe Open Sources Tools For Apache Hadoop

by Alex Giamas on  Dec 09, 2014

Stripe, the internet payments infrastructure company recently announced open sourcing a set of internally developed tools based on Apache Hadoop.Timberlake, Brushfire, Sequins and Herringbone all contribute to enriching the available tools for building an Apache Hadoop stack.

Spark Sets New Record in Sort Performance

by Benjamin Darfler on  Nov 26, 2014

Databricks has recently announced a new record in the Daytona GraySort contest using the Spark processing engine. The Daytona GraySort contest is a 3rd party benchmark measuring how fast a system can sort 100 Terabytes of data. Databricks posted a throughput of 4.27 TB/min over a cluster of 206 machines for their official run.

Hortonworks Data Platform Makes an Enterprise Push

by Rags Srinivas on  Nov 14, 2014

Hortonworks Data Platform (HDP) version 2.2 with features based around Hadoop and YARN has better support for enterprise features such as security, compliance and so on as well.

Microsoft Expands Azure Machine Learning and Real Time Analytics Offering

by Alex Giamas on  Oct 31, 2014

Microsoft recently announced new machine learning capabilities for Microsoft Azure platform. Developers can also create their own web services and publish them to Azure Marketplace. Microsoft also announced availability of Apache Storm for Azure. Azure Stream Analytics, Data Factory and Event Hubs for Azure were all announced in the past few weeks by Microsoft. In this article we explore moreabout

Big Data Analytics: Using Hunk with Hadoop and Elastic MapReduce

by Jonathan Allen on  Oct 07, 2014

Hunk is a relatively new product from Splunk for exploring and visualizing Hadoop and other NoSQL data stores. New in this release is support for Amazon’s Elastic MapReduce.

Apache Drill Included in MapR Latest Distribution Release

by Alex Giamas on  Sep 30, 2014

MapR recently announced including Apache Drill in its latest release of MapR distribution. Apache Drill is the open source version of Google’s Dremel. Dremel is the infrastructure on which BigQuery is based upon. Drill is offering a low latency SQL-on-Hadoop interface. While this puts it in the same space as several other technologies around Hadoop, Drill has some unique characteristics setting it

Hortonworks Announces Stinger.next Roadmap to Deliver Hadoop Scale SQL with Apache Hive

by Adam Berry on  Sep 25, 2014

Following on from the Stinger initiative delivered in Apache Hive 0.13, Hortonworks has laid out the Stinger.next roadmap to provide fully ACID transactions, a sub-second query engine, and more complete SQL 2011 analytics support, all driving towards the goal of “enhancing the speed, scale and breadth of SQL support” in Hive.

Data Encryption in Apache Hadoop with Project Rhino - Q&A with Steven Ross

by Abhishek Sharma on  Aug 14, 2014

Cloudera recently released an update over Project Rhino and data at-rest encryption in Apache Hadoop. Project Rhino is an effort of Cloudera, Intel and Hadoop community to bring a comprehensive security framework for data protection. InfoQ recently talked to Steven Ross from Cloudera team to learn more about the project.

Cloudbreak, New Hadoop as a Service API, Enters Open Beta

by Sergio De Simone on  Jul 22, 2014

Cloudbreak, a new open-source and cloud-agnostic Hadoop as a Service API, is now open for beta access to application developers and enterprises. SequenceIQ, Cloudbreak's maker, claims that its freely available product will make it easier to manage and monitor on-demand Hadoop clusters while also abstracting their provisioning.

Cloudera Acquires Big Data Encryption Startup Gazzang

by Jérôme Serrano on  Jul 15, 2014

Hadoop distributor Cloudera pursued its strategy of securing the Hadoop ecosystem by acquiring last month the big data encryption and key management startup Gazzang. The deal will strengthen Cloudera's security offering and lead to the creation of a center of excellence for Hadoop security that will initially be fueled by Gazzang’s engineering team.

Hortonworks Acquires XA Secure to Strengthen Security in Enterprise Hadoop

by Abhishek Sharma on  Jun 23, 2014

Hortonworks recently acquired the data security company XA Secure to help the organization in providing comprehensive security to Hortonworks Data Platform (HDP). Security features would be available across all Hadoop workloads from batch, interactive SQL and real–time.

Hadoop Summit 2014 Day Two - On the Path to Enterprise Grade Hadoop

by Jeevak Kasarkod on  Jun 05, 2014

Hadoop Summit Day Two report covers the important trends and changes from last year's summit. It also covers the important announcements of the day in relation to this year's trending topics. This report shares an analysis of the Hadoop market by leading analysts, competing benchmarks by vendors and platform specific innovations and announcements.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT