BT
Older rss
58:37

Building Distributed Systems with Apache Mesos

Posted by Benjamin Hindman  on  Jul 25, 2015

Benjamin Hindman discusses Apache Mesos, focusing on the Mesos API and how the primitives provided by Mesos can make it easier to build new stateful services and frameworks.

38:03

Efficient Data Storage for Analytics with Parquet 2.0

Posted by Julien Le Dem  on  Mar 22, 2015

Julien Le Dem discusses the advantages of a columnar data layout, specifically the features and design choices Apache Parquet uses to achieve goals of interoperability, space and query efficiency.

19:26

Why Would You Integrate Solr and Hadoop?

Posted by Yann Yu  on  Dec 28, 2014

Yann Yu discusses how Solr and Hadoop complement each other, and how to use Solr as a real-time, analytical, full-text search front-end to data stored in Hadoop.

30:34

Apache Spark Plus Many Other Frameworks: How Spark Fits into the Big Data Landscape

Posted by Paco Nathan  on  Nov 09, 2014

Paco Nathan keynotes on how Spark fits into the big data landscape, describing what other systems work with Spark, and explaining why Spark is needed in the future.

01:33:09

What's New in Spring Data?

Posted by Thomas Darimont,Oliver Gierke,Christoph Strobl  on  Nov 07, 2014 1

This talk provides a broad overview of the new features introduced in the latest Spring Data release trains: recent additions in Spring Data Commons and the latest features of individual store modules

50:59

ZooKeeper for the Skeptical Architect

Posted by Camille Fournier  on  Aug 17, 2014 3

Camille Fournier explains what projects ZooKeeper is useful for, the common challenges running it as a service and advice to consider when architecting a system using it.

47:18

Going Native with Apache Cassandra

Posted by Johnny Miller  on  Jun 18, 2014

In this solutions track talk, sponsored by DataStax, Johnny Miller introduces the Cassandra native protocol, native drivers and CQL, explaining how to query Cassandra without Trift or RPC.

38:16

Apache Tez: Accelerating Hadoop Query Processing

Posted by Bikas Saha, Arun Murthy  on  Dec 05, 2013

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

44:23

Samza: Real-time Stream Processing at LinkedIn

Posted by Chris Riccomini  on  Nov 28, 2013 1

Chris Riccomini discusses: Samza's feature set, how Samza integrates with YARN and Kafka, how it's used at LinkedIn, and what's next on the roadmap.

36:45

Apache Drill - Interactive Query and Analysis at Scale

Posted by Michael Hausenblas  on  Oct 13, 2013

Michael Hausenblas introduces Apache Drill, a distributed system for interactive analysis of large-scale datasets, including its architecture and typical use cases.

57:12

Rebuilding Your Engine at 200 Miles per Hour

Posted by Michael Brunton-Spall  on  Aug 16, 2013

Michael Brunton-Spall shares his experience re-architect The Guardian’ Content API from a system based on Solr to a message queue cloud service based upon Elastic Search, without any downtime.

NetApp Case Study

Posted by Kumar Palaniapan and Scott Fleming  on  Jun 01, 2012 1

Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2015 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT