InfoQ

InfoQ

Presentation

My Bookmarks

Login or Register to enable bookmarks for unlimited time.

The content has been bookmarked!

There was an error bookmarking this content! Please retry.

Recorded at:
Recorded at

Adopting Apache Cassandra

Presented by Eben Hewitt on Dec 01, 2010 Length 00:53:24     Download: MP3
     Slides
Sections
Operations & Infrastructure,
Architecture & Design,
Development
Topics
Cassandra ,
Big Data ,
BigTable ,
Strange Loop 2010 ,
Data Access ,
Database Design ,
Strange Loop ,
API ,
Cloud Computing ,
NoSQL ,
Architecture ,
Database ,
Programming ,
Conferences
 

How would you like to view the presentation?

In case you are having issues watching this video, please follow these simple steps to help us investigate the issue:
1. Right click on the video player and select Copy log
2. Paste the copied information in an email to video-issue@infoq.com (clicking this link will fill in the default details in most email clients).
Note: in case your email client hasn't automatically picked up the email subject, please include in your email the URL of the video too.
3. Done.
We will investigate the issue and get back to you as soon as possible. Thanks for helping us improve our site!
Summary
Eben Hewitt introduces the Apache Cassandra project to those interested in getting a quick clear picture of what Cassandra is, what are its main features, what is the the data model used and the API.

Bio
Eben Hewitt is Director of Application Architecture at Choice Hotels International. Hewitt has spoken to thousands of developers around the world on SOA, REST, and general architecture topics, has several certifications in Java and TOGAF. He is the author of “Cassandra: The Definitive Guide” and “Java SOA Cookbook” and a contributor to 97 Things Every Software Architect Should Know.

About the conference
Strange Loop is a developer-run software conference. Innovation, creativity, and the future happen in the magical nexus "between" established areas. Strange Loop eagerly promotes a mix of languages and technologies in this nexus, bringing together the worlds of bleeding edge technology, enterprise systems, and academic research. Of particular interest are new directions in data storage, alternative languages, concurrent and distributed systems, front-end web, semantic web, and mobile apps.

Related Sponsor

Neo4j is a robust, high-performance, scalable graph database. It is the only NOSQL database that solves the complex, connected data challenges that enterprises face today.

  • This article is part of a featured topic series on NoSQL
Good overview but some contrast/comparison with MongoDB would be useful by Faisal Waris Posted
Re: Good overview but some contrast/comparison with MongoDB would be useful by Alex Miller Posted
  1. Back to top

    Good overview but some contrast/comparison with MongoDB would be useful

    by Faisal Waris

    I just started dabbling with MongoDB for some data mining applications.

    My sense is that MongoDB is the most widely used no-sql db today and so some comparison to it would be useful.

    On the face of it, Mongo offers some features that are missing in Cassandra today:

    - adhoc query capability using a javascript type language
    - ability to run server side script functions
    - auto sharding (mongo shards and balances the distribution, automatically)
    - map-reduce with server side scripts

    It seems that Cassandra is better at handling a large volume of writes but requires programming to perform queries and data processing.

    Mongo has some notion of 'client tunable' consistency but I am still grapling with that.

    For data mining (where you load data less fequently and do many queries) would Mongo be a better choice? If large volume of writes are required then Cassandra should be favored?

  2. Back to top

    Re: Good overview but some contrast/comparison with MongoDB would be useful

    by Alex Miller

    FYI, there were a couple more nosql-related talks recorded at Strange Loop that will available here on InfoQ in the coming months:


    • Week of Dec 20th - "NoSQL at Twitter" - Kevin Weil (Twitter)

    • Week of Jan 17th - "Enterprise NoSQL: Silver Bullet or Poison Pill?" - Billy Newport (IBM)

    • Week of Feb 14th - a panel discussion about non-relational stores

    • Week of Feb 28th - "Scaling MongoDB" by Roger Bodamer of 10gen

Educational Content

Eventually Consistent HTTP with Statebox and Riak

Bob Ippolito explains how to solve concurrent update conflicts with Statebox, an open source library for automatic conflict resolution, running on top of Riak.

Java.next

Erik Onnen attempts to demonstrate that Java is still the best programming language for the JVM if simplified idioms are used along with proper tooling.

Evolution in Data Integration From EII to Big Data

Approaches to integrating data are changing with emergence of cloud computing.

Winning Hearts and Minds: How to Embed UX from Scratch in a Large Organization

Michele Ide-Smith presents the lessons learned in the process of introducing UX principles and techniques into a large organization through a series of small steps.

LMAX Disruptor: 100K TPS at Less than 1ms Latency

Dave Farley and Martin Thompson discuss solutions for doing low-latency high throughput transactions based on the Disruptor concurrency pattern.

Thoughts on Test Automation in Agile

Rajneesh Namta shares his thoughts, experiences, and some of the critical lessons learned while implementing software test automation on a recent Agile project.

Actor Interaction Patterns

Dale Schumacher presents several patterns of actor interaction that can be used in collaborative programs written in any language.

Scalaz: Functional Programming in Scala

Rúnar Bjarnason discusses Scalaz, a Scala library of pure data structures, type classes, highly generalized functions, and concurrency abstractions to perform functional programming in Scala.