InfoQ Homepage Database Content on InfoQ

Presentations

RSS Feed

Newer Older

Groovy Vampires: Combining Groovy, REST, NoSQL, and More

Ken Kousen discusses combining various technologies: Groovy, Ratpack, MongoDB, Grails, REST.

Ken Kousen
on May 09, 2015

Icon

01:26:08
A Taste of Random Decision Forests on Apache Spark

Sean Owen introduces Spark, Scala and random decision forests, and demonstrates the process of analyzing a real-world data set with them.

Sean Owen
on Apr 28, 2015

Icon

48:14
Analyzing Social Networks with F#

Evelina Gabasova explains how to run a social network analysis on Twitter and how to use data science tools to find out more about followers.

Evelina Gabasova
on Apr 22, 2015

Icon

53:13
Don’t Let Data Gravity Crush Your Infrastructure

Dave McCrory talks about what is Data Gravity, how it affects performance and portability and why these effects are amplified when there are larger volumes of data.

Dave McCrory
on Apr 19, 2015

Icon

48:23
How SoundCloud Uses Cassandra

Emily Green is taking a look at how SoundCloud uses Cassandra. She describes a couple of Cassandra instances, from the point of view of the products and functionality they support.

Emily Green
on Apr 19, 2015

Icon

42:08
Customer Insight, from Data to Information

Thore Thomassen shares from experience how to combine structured data in a DWH with unstructured data in NoSQL, and using parallel data warehouse appliances to boost the analytical capabilities.

Thore Thomassen
on Mar 27, 2015

Icon

57:10
Efficient Data Storage for Analytics with Parquet 2.0

Julien Le Dem discusses the advantages of a columnar data layout, specifically the features and design choices Apache Parquet uses to achieve goals of interoperability, space and query efficiency.

Julien Le Dem
on Mar 22, 2015

Icon

38:03
GORM Inside and Out

Jeff Scott Brown introduces GORM, a super powerful ORM tool that makes ORM simple by leveraging the flexibility and expressiveness of a dynamic language like Groovy.

Jeff Scott Brown
on Mar 21, 2015

Icon

01:28:53
Programming and Testing a Distributed Database

Reid Draper shows how real world distributed database work, communicate and are tested, trading RPC for messaging, unit-tests for QuickCheck, and micro-benchmarks for multi-week stress tests.

Reid Draper
on Mar 20, 2015

Icon

33:44
Using a Graph Database for JVM Heap Analysis

James Richardson, Nat Pryce discuss some of the challenges faced using Neo4J for interactive analysis of large data imports (80K nodes, 150k relationships) and how they overcame them.

Nat Pryce James Richardson
on Mar 19, 2015

Icon

01:23:00
Big Data in Memory

John Davies shows a Spring work-flow consuming 7.4kB XML messages, binding them to 25kB Java but storing them in just 450 bytes each, 10 million derivative contracts in-memory on a laptop.

John Davies
on Mar 14, 2015

Icon

01:06:43
Gobblin: A Framework for Solving Big Data Ingestion Problem

Lin Qiao discusses the architecture of Gobblin, LinkedIn’s framework for addressing the need of high quality and high velocity data ingestion.

Lin Qiao
on Mar 12, 2015

Icon

44:13

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations