BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage Big Data Content on InfoQ

  • Splunk's Hunk 6.1 Brings New Capabilities for Big Data Analytics

    Splunk, a company specializing in searching, monitoring, and analyzing machine-generated data, has announced the release of Hunk 6.1. Hunk provides an analytics platform for big data. The new release also provides streaming resource libraries to connect Hunk to any NoSQL data store, such as Apache Cassandra, MongoDB, and Neo4j.

  • Community the Focus at ApacheCON NA 2014

    This year's ApacheCON North America conference saw key speakers focus on open source and its community. With more than 400 attendees, over 70 projects represented and 180 conference sessions it covered as many diverse topics as diverse the Apache Software Foundation projects are.

  • Cascading 3.0 Adds Multiple Framework Support. Concurrent Driven Manages Big Data Apps

    Concurrent will release Cascading 3.0 in early summer to allow certain applications to run on multiple Big Data frameworks including MapReduce, Tez, Spark, Storm and others. Additionally, Driven, the new commercial product from Concurrent, provides powerful enterprise data application management for Big Data applications.

  • Hortonworks Announces Hive 0.13 with Vectorized Query Execution and Hive on Tez

    Hortonworks announced the release of Hive 0.13 which marks the completion of the Stinger initiative. The new release also includes performance improvements as well as some new SQL features. Hive is an open source SQL Engine written on top of Hadoop that lets users query big data warehouses by writing SQL queries instead of MapReduce jobs.

  • Cloudera Partners with MongoDB to Store Hadoop Data on Their NoSQL DB

    Starting from the premise that today “80 percent of enterprise data is unstructured and growing at twice the rate of structured data”, Cloudera and MongoDB have announced a “strategic” partnership meant to provide customers the option to combine Cloudera’s Apache-based Big Data platform with MongoDB’s NoSQL solution.

  • Continuous Development,is it our new maintenance reality?

    The Internet of Things, Web APIs and Big Data will make continuous development a necessary reality and will tie developers down with maintenance work on completed applications, says Andrew Binstock of Dr. Dobbs. In that case, short sprints, continuous integration and deployment and modern programming practices are even more important to ensure a developer's time is better utilized.

  • DataBricks Announces Spark SQL for Manipulating Structured Data Using Spark

    DataBricks, the company behind Apache Spark, has announced a new addition into the Spark ecosystem called Spark SQL. Spark SQL is separate from Shark, and does not use Hive under the hood. InfoQ reached out to Reynold Xin and Michael Armbrust, software engineers at DataBricks, to learn more about Spark SQL.

  • A Roundup of Cloudera Distribution Containing Apache Hadoop 5

    Cloudera recently released the latest version of its software distribution, CDH5. Almost 20 months after the last major version, CDH4 seems like ages in the Big Data world. We take a look at new features this release brings and the future direction of Cloudera after the latest round of investment from Intel and Google Ventures.

  • Hydra Takes On Hadoop

    The social-networking company AddThis open-sourced Hydra under the Apache version 2.0 License in a recent announcement. Hydra grew from an in-house platform created to process semi-structured social data as live streams and do efficient query processing on those data sets.

  • Spark Gets a Dedicated Big Data Platform

    Spark users can now use a new Big Data platform provided by intelligence company Atigeo, which bundles most of the UC Berkeley stack into a unified framework optimized for low-latency data processing that can provide significant improvements over more traditional Hadoop-based platforms.

  • Rebecca Parsons on the ThoughtWorks Technology Radar

    In January ThoughtWorks released the latest version of their Technology Radar in which they track what's interesting in the software development ecosystem. The big themes this year are (1) early warning systems and recovery in production, (2) the tension between privacy and big data, (3) the javascript ecosystem and (4) blurring of the line between the physical and virtual worlds.

  • HBase 0.98 Introduces Cell-based Security

    Apache released HBase 0.98 primarily addressing convergence with Apache Accumulo via cell-based security while resolving over 230 JIRA issues. These new security features are modeled after Accumulo.

  • Graph Processing Using Big Data Technologies

    Processing extremely large graphs has been and remains a challenge, but recent advances in Big Data technologies have made this task more practical. Tapad, a startup based in NYC focused on cross-device content delivery, has made graph processing the heart of their business model using Big Data to scale to terabytes of data.

  • Domino: Datascience-as-a-Service

    Domino, a Platform-as-a-Service for data science, enables people to do analytical work using languages such as Python or R in the cloud (EC2).

  • Big Data Hadoop Solutions, State of Affairs in Q1/2014

    According to a new Forrest report, Hadoop’s momentum is unstoppable. Its usage in the enterprise is continuously growing due to its ability to offer companies new ways to store, process, analyze, and share big data. The report takes a look at Hadoop vendors and ranks them.

BT