BT

Vivint and Cloudera Analyzing Big Data from the Smart Home

by Alex Giamas on  Aug 05, 2014

Vivint recently announced that it is partnering up with Cloudera to analyze more efficiently data coming from Smart Home sensors. The sensors in a residence vary from thermostats to security oriented devices. Analyzing this data centrally, Vivint can provide actionable insights for customers that can provide energy savings. Heating ventilation and cooling(HVAC) accounts for more than 40 percent of

Cloudbreak, New Hadoop as a Service API, Enters Open Beta

by Sergio De Simone on  Jul 22, 2014

Cloudbreak, a new open-source and cloud-agnostic Hadoop as a Service API, is now open for beta access to application developers and enterprises. SequenceIQ, Cloudbreak's maker, claims that its freely available product will make it easier to manage and monitor on-demand Hadoop clusters while also abstracting their provisioning.

Cloudera Acquires Big Data Encryption Startup Gazzang

by Jérôme Serrano on  Jul 15, 2014

Hadoop distributor Cloudera pursued its strategy of securing the Hadoop ecosystem by acquiring last month the big data encryption and key management startup Gazzang. The deal will strengthen Cloudera's security offering and lead to the creation of a center of excellence for Hadoop security that will initially be fueled by Gazzang’s engineering team.

Docker 1.0 Released at DockerCon

by Chris Swan on  Jun 10, 2014

Docker.io have used their inaugural DockerCon event to launch version 1.0 of their container management tools. It comes just days after the release of 0.12.0, which was focussed on stability, performance and usability rather than introducing significant new features. Production readiness means that Docker.io is now providing support services for Docker.

Ayasdi Partners with Cloudera

by Jérôme Serrano on  Jun 06, 2014

Ayasdi announced last month a partnership with Cloudera, the biggest distributor of Apache Hadoop. The partnership that will ensure the compatibility of their solution with Cloudera Enterprise 5, the latest version of Cloudera’s big data platform based on Apache Hadoop.

Splunk's Hunk 6.1 Brings New Capabilities for Big Data Analytics

by Matt Kapilevich on  May 21, 2014

Splunk, a company specializing in searching, monitoring, and analyzing machine-generated data, has announced the release of Hunk 6.1. Hunk provides an analytics platform for big data. The new release also provides streaming resource libraries to connect Hunk to any NoSQL data store, such as Apache Cassandra, MongoDB, and Neo4j.

Community the Focus at ApacheCON NA 2014

by Carlos Sanchez on  May 15, 2014

This year's ApacheCON North America conference saw key speakers focus on open source and its community. With more than 400 attendees, over 70 projects represented and 180 conference sessions it covered as many diverse topics as diverse the Apache Software Foundation projects are.

Cascading 3.0 Adds Multiple Framework Support. Concurrent Driven Manages Big Data Apps

by Boris Lublinsky on  May 13, 2014

Concurrent will release Cascading 3.0 in early summer to allow certain applications to run on multiple Big Data frameworks including MapReduce, Tez, Spark, Storm and others. Additionally, Driven, the new commercial product from Concurrent, provides powerful enterprise data application management for Big Data applications.

Hortonworks Announces Hive 0.13 with Vectorized Query Execution and Hive on Tez

by Matt Kapilevich on  May 13, 2014

Hortonworks announced the release of Hive 0.13 which marks the completion of the Stinger initiative. The new release also includes performance improvements as well as some new SQL features. Hive is an open source SQL Engine written on top of Hadoop that lets users query big data warehouses by writing SQL queries instead of MapReduce jobs.

Cloudera Partners with MongoDB to Store Hadoop Data on Their NoSQL DB

by Abel Avram on  Apr 29, 2014

Starting from the premise that today “80 percent of enterprise data is unstructured and growing at twice the rate of structured data”, Cloudera and MongoDB have announced a “strategic” partnership meant to provide customers the option to combine Cloudera’s Apache-based Big Data platform with MongoDB’s NoSQL solution.

Continuous Development,is it our new maintenance reality?

by Jeevak Kasarkod on  Apr 21, 2014 1

The Internet of Things, Web APIs and Big Data will make continuous development a necessary reality and will tie developers down with maintenance work on completed applications, says Andrew Binstock of Dr. Dobbs. In that case, short sprints, continuous integration and deployment and modern programming practices are even more important to ensure a developer's time is better utilized.

DataBricks Announces Spark SQL for Manipulating Structured Data Using Spark

by Matt Kapilevich on  Apr 19, 2014

DataBricks, the company behind Apache Spark, has announced a new addition into the Spark ecosystem called Spark SQL. Spark SQL is separate from Shark, and does not use Hive under the hood. InfoQ reached out to Reynold Xin and Michael Armbrust, software engineers at DataBricks, to learn more about Spark SQL.

A Roundup of Cloudera Distribution Containing Apache Hadoop 5

by Alex Giamas on  Apr 18, 2014

Cloudera recently released the latest version of its software distribution, CDH5. Almost 20 months after the last major version, CDH4 seems like ages in the Big Data world. We take a look at new features this release brings and the future direction of Cloudera after the latest round of investment from Intel and Google Ventures.

Hydra Takes On Hadoop

by Rags Srinivas on  Apr 11, 2014

The social-networking company AddThis open-sourced Hydra under the Apache version 2.0 License in a recent announcement. Hydra grew from an in-house platform created to process semi-structured social data as live streams and do efficient query processing on those data sets.

Spark Gets a Dedicated Big Data Platform

by Charles Menguy on  Apr 03, 2014

Spark users can now use a new Big Data platform provided by intelligence company Atigeo, which bundles most of the UC Berkeley stack into a unified framework optimized for low-latency data processing that can provide significant improvements over more traditional Hadoop-based platforms.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT