BT
rss
53:16

Finding the Needle in a Big Data Haystack

Posted by Eva Andreasson  on  Jul 08, 2014

In this solutions track talk, sponsored by Cloudera, Eva Andreasson discusses how search and Hadoop can help with some of the industry's biggest challenges. She introduces the data hub concept.

01:21:42

Cool and Ripe for Exploitation: Search-Based Software Engineering (SBSE)

Posted by Christopher Simons  on  Jun 27, 2014

Christopher Simons suggests using SBSE to iterated through multiple possible solutions and select the one that performs the best, offering insight into some available tools and techniques.

46:21

How Elasticsearch Powers the Guardian's Newsroom

Posted by Graham Tackley & Shay Banon  on  Apr 28, 2014 1

Graham Tackley dives into the details of ophan - obstacles faced by the newsroom that prompted them to build the system, how it works for alerting and how the tool has made the Guardian's readers - and staffers - lives better. Shay Banon covers the technical underpinnings of ophan with a deep dive into the Elasticsearch features and functionality that power the ophan system.

42:59

Revealing the Uncommonly Common with Elasticsearch

Posted by Mark Harwood  on  Apr 08, 2014

Mark Harwood shows how anomaly detection algorithms can spot card fraud, incorrectly tagged movies and the UK's most unexpected hotspot for weapon possession.

01:09:05

Search for the Holy Grail (and test it once found)

Posted by Baruch Sadogursky  on  Dec 13, 2013

Baruch Sadogursky overviews and compares search and testing tools available to Grails developers.

57:12

Rebuilding Your Engine at 200 Miles per Hour

Posted by Michael Brunton-Spall  on  Aug 16, 2013

Michael Brunton-Spall shares his experience re-architect The Guardian’ Content API from a system based on Solr to a message queue cloud service based upon Elastic Search, without any downtime.

NetApp Case Study

Posted by Kumar Palaniapan and Scott Fleming  on  Jun 01, 2012 1

Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.

The Guardian's Content Web API

Posted by Phil Wills  on  Feb 29, 2012

Phil Wills discusses why The Guardian has introduced the Content Web API, how it has influenced the architecture of the site and how they develop software and collaborate with partners.

ElasticSearch - A Distributed Search Engine

Posted by Shay Banon  on  Oct 18, 2011 2

Shay Banon demoes ElasticSearch, an open source distributed and RESTful search engine, detailing some of its features: distributed, cloud readiness, facets, and percolator.

LinkedIn Search: Searching the Social Graph in Real Time

Posted by John Wang  on  Apr 26, 2011

John Wang discusses LinkedIn real-time distributed search engine architecture and implementation details for People Search, Signal, Stream Indexing, Zoie, and Bobo.

Hypertable - An Open Source, High Performance, Scalable Database

Posted by Doug Judd  on  Jul 31, 2009 3

This presentation discusses Hypertable, an open source, high performance, distributed database modeled after Google's Bigtable. Doug discusses the differences between Hypertable and traditional database technology, support for massive sparse tables, scaling to petabytes size, and how Hypertable is designed to run on top of an existing distributed file system, such as the Hadoop DFS.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT