Mark Harwood shows how anomaly detection algorithms can spot card fraud, incorrectly tagged movies and the UK's most unexpected hotspot for weapon possession.
Baruch Sadogursky overviews and compares search and testing tools available to Grails developers.
Michael Brunton-Spall shares his experience re-architect The Guardian’ Content API from a system based on Solr to a message queue cloud service based upon Elastic Search, without any downtime.
Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.
Phil Wills discusses why The Guardian has introduced the Content Web API, how it has influenced the architecture of the site and how they develop software and collaborate with partners.
Shay Banon demoes ElasticSearch, an open source distributed and RESTful search engine, detailing some of its features: distributed, cloud readiness, facets, and percolator.
John Wang discusses LinkedIn real-time distributed search engine architecture and implementation details for People Search, Signal, Stream Indexing, Zoie, and Bobo.
This presentation discusses Hypertable, an open source, high performance, distributed database modeled after Google's Bigtable. Doug discusses the differences between Hypertable and traditional database technology, support for massive sparse tables, scaling to petabytes size, and how Hypertable is designed to run on top of an existing distributed file system, such as the Hadoop DFS.