BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage Cloudera Content on InfoQ

  • Cloudera Announces the General Availability of Cloudera DataFlow for the Public Cloud

    The enterprise data cloud company Cloudera recently announced the general availability (GA) of Cloudera DataFlow for the Public Cloud, a cloud-native service for data flows to process hybrid streaming workloads on the Cloudera Data Platform (CDP).

  • Google Introduces Cloud Storage Connector for Hadoop Big Data Workloads

    In a recent blog post, Google announced a new Cloud Storage connector for Hadoop. This new capability allows organizations to substitute their traditional HDFS with Google Cloud Storage. Columnar file formats such as Parquet and ORC may realize increased throughput, and customers will benefit from Cloud Storage directory isolation, lower latency, increased parallelization and intelligent defaults

  • Cloudera and Hortonworks Merge with Goal to Increase Competition with Cloud Offerings

    Earlier this month, Cloudera and Hortonworks announced an all-stock merger at a combined value of around $5.2 billion. Analysts have argued that this merger is aimed at increased competition that both companies are facing from cloud vendors like Amazon, Google and Microsoft. In this article we log reactions from analysts and the industry, and the implications for current customers.

  • Adatao Launches Full Stack Data Intelligence Platform

    Adatao recently announced the general availability of its Data Intelligence platform. Its platform aims to make data analysis and predictive analytics available to everyone in large organizations. Adatao had secured an investment of $13 million last year from a group of investors including Bloomberg Beta, Lightspeed Venture Partners and Andreessen Horowitz.

  • Stripe Open Sources Tools For Apache Hadoop

    Stripe, the internet payments infrastructure company recently announced open sourcing a set of internally developed tools based on Apache Hadoop.Timberlake, Brushfire, Sequins and Herringbone all contribute to enriching the available tools for building an Apache Hadoop stack.

  • Apache Drill Included in MapR Latest Distribution Release

    MapR recently announced including Apache Drill in its latest release of MapR distribution. Apache Drill is the open source version of Google’s Dremel. Dremel is the infrastructure on which BigQuery is based upon. Drill is offering a low latency SQL-on-Hadoop interface. While this puts it in the same space as several other technologies around Hadoop, Drill has some unique characteristics setting it

  • Vivint and Cloudera Analyzing Big Data from the Smart Home

    Vivint recently announced that it is partnering up with Cloudera to analyze more efficiently data coming from Smart Home sensors. The sensors in a residence vary from thermostats to security oriented devices. Analyzing this data centrally, Vivint can provide actionable insights for customers that can provide energy savings. Heating ventilation and cooling(HVAC) accounts for more than 40 percent of

  • Cloudera Acquires Big Data Encryption Startup Gazzang

    Hadoop distributor Cloudera pursued its strategy of securing the Hadoop ecosystem by acquiring last month the big data encryption and key management startup Gazzang. The deal will strengthen Cloudera's security offering and lead to the creation of a center of excellence for Hadoop security that will initially be fueled by Gazzang’s engineering team.

  • Hadoop Summit 2014 Day Two - On the Path to Enterprise Grade Hadoop

    Hadoop Summit Day Two report covers the important trends and changes from last year's summit. It also covers the important announcements of the day in relation to this year's trending topics. This report shares an analysis of the Hadoop market by leading analysts, competing benchmarks by vendors and platform specific innovations and announcements.

  • Cloudera Partners with MongoDB to Store Hadoop Data on Their NoSQL DB

    Starting from the premise that today “80 percent of enterprise data is unstructured and growing at twice the rate of structured data”, Cloudera and MongoDB have announced a “strategic” partnership meant to provide customers the option to combine Cloudera’s Apache-based Big Data platform with MongoDB’s NoSQL solution.

  • A Roundup of Cloudera Distribution Containing Apache Hadoop 5

    Cloudera recently released the latest version of its software distribution, CDH5. Almost 20 months after the last major version, CDH4 seems like ages in the Big Data world. We take a look at new features this release brings and the future direction of Cloudera after the latest round of investment from Intel and Google Ventures.

  • Interactive SQL in Apache Hadoop with Impala and Hive

    In the race for interactive SQL in Big Data environments, there are two open source based front-runners, Impala and Hive with the Stinger project. Cloudera recently announced that Impala is up to 69 times faster than Hive 0.12 and can outperform DBMS. Other than raw speed, we take a look at other considerations in choosing a SQL engine for Hadoop and also Tez, an application framework for YARN.

  • Spark, Storm and Real Time Analytics

    Hadoop is definitely the platform of choice for Big Data analysis and computation. While data Volume, Variety and Velocity increases, Hadoop as a batch processing framework cannot cope with the requirement for real time analytics. Spark, Storm and the Lambda Architecture can help bridge the gap between batch and event based processing.

  • Oracle Big Data Appliance and Connectors Support Integration with Hadoop and Cloudera Manager

    Oracle Big Data Appliance and Big Data Connectors support integration with Hadoop, Cloudera Manager and Oracle NoSQL Database. Oracle announced last month the availability of Big Data Appliance and Connectors as well as partnership with Cloudera. They also recently announced the Advanced Analytics for Big Data by integrating R statistical programming language into Oracle Database 11g.

  • Hadoop-as-a-Service from Amazon, Cloudera, Microsoft and IBM

    Companies rely more and more on big data when making their decisions. Amazon, Cloudera, and IBM have announced their Hadoop-as-a-Service offerings, while Microsoft promises to do the same next year.

BT