Older rss

Rethinking Streaming Analytics for Scale

Posted by Helena Edelson  on  Apr 03, 2016

Helena Edelson addresses new architectures emerging for large scale streaming analytics based on Spark, Mesos, Akka, Cassandra and Kafka (SMACK) or Apache Flink or GearPump.


Lightning Fast Cluster Computing with Spark and Cassandra

Posted by Piotr Kołaczkowski  on  Jun 17, 2015

Piotr Kołaczkowski discusses how they integrated Spark with Cassandra, how it was done, how it works in practice and why it is better than using a Hadoop intermediate layer.


How SoundCloud Uses Cassandra

Posted by Emily Green  on  Apr 19, 2015 1

Emily Green is taking a look at how SoundCloud uses Cassandra. She describes a couple of Cassandra instances, from the point of view of the products and functionality they support.


Efficient Data Storage for Analytics with Parquet 2.0

Posted by Julien Le Dem  on  Mar 22, 2015

Julien Le Dem discusses the advantages of a columnar data layout, specifically the features and design choices Apache Parquet uses to achieve goals of interoperability, space and query efficiency.


NoSQL Is Dead

Posted by Eric Redmond  on  Feb 06, 2015 1

Eric Redmond explains the differences and commonalities amongst many kinds of databases and takes a stab at the marketing term “NoSQL.”


A Distributed Transactional Database on Hadoop

Posted by John Leach  on  Jan 02, 2015

John Leach explains using HBase co-processors to support a full ANSI SQL RDBMS without modifying the core HBase source, showing how Hadoop/HBase can replace traditional RDBMS solutions.


Cassandra, Couchbase and Spring Data in the Enterprise

Posted by Matthew Adams,Michael Nitschinger  on  Jan 01, 2015

The authors focus on POJO persistence over Cassandra, including automatic Cassandra schema generation and Spring context configuration using both XML and Java.


Zen: Pinterest's Graph Storage Service

Posted by Xun Liu,Raghavendra Prabhu  on  Dec 25, 2014

This talk goes over the design motivation for Zen and describe its internals including the API, type system and HBase backend.


Unleash the Power of HBase Shell

Posted by Jayesh Thakrar  on  Dec 07, 2014

Jayesh Thakrar shows what can be done with irb, how to exploit JRuby-Java integration, and demonstrates how the Shell can be used in Hadoop streaming to perform complex and large volume batch jobs.


Going Native with Apache Cassandra

Posted by Johnny Miller  on  Jun 18, 2014

In this solutions track talk, sponsored by DataStax, Johnny Miller introduces the Cassandra native protocol, native drivers and CQL, explaining how to query Cassandra without Trift or RPC.


Scaling Pinterest

Posted by Yash Nelapati, Marty Weiner  on  Dec 30, 2013

Details on Pinterest's architeture, its systems -Pinball, Frontdoor-, and stack - MongoDB, Cassandra, Memcache, Redis, Flume, Kafka, EMR, Qubole, Redshift, Python, Java, Go, Nutcracker, Puppet, etc.


Graph Computing at Scale

Posted by Matthias Broecheler  on  Dec 27, 2013

Matthias Broecheler discusses graph computing, introducing the Aurelius graph cluster enabling graph computing at scale by building on distributed systems like Cassandra, HBase, and Hadoop.

General Feedback
Marketing and all content copyright © 2006-2016 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy

We notice you're using an ad blocker

We understand why you use ad blockers. However to keep InfoQ free we need your support. InfoQ will not provide your data to third parties without individual opt-in consent. We only work with advertisers relevant to our readers. Please consider whitelisting us.