BT
Older Newer rss
01:26:08

Groovy Vampires: Combining Groovy, REST, NoSQL, and More

Posted by Ken Kousen  on  May 09, 2015

Ken Kousen discusses combining various technologies: Groovy, Ratpack, MongoDB, Grails, REST.

48:14

A Taste of Random Decision Forests on Apache Spark

Posted by Sean Owen  on  Apr 28, 2015

Sean Owen introduces Spark, Scala and random decision forests, and demonstrates the process of analyzing a real-world data set with them.

53:13

Analyzing Social Networks with F#

Posted by Evelina Gabasova  on  Apr 22, 2015

Evelina Gabasova explains how to run a social network analysis on Twitter and how to use data science tools to find out more about followers.

48:23

Don’t Let Data Gravity Crush Your Infrastructure

Posted by Dave McCrory  on  Apr 19, 2015

Dave McCrory talks about what is Data Gravity, how it affects performance and portability and why these effects are amplified when there are larger volumes of data.

42:08

How SoundCloud Uses Cassandra

Posted by Emily Green  on  Apr 19, 2015 1

Emily Green is taking a look at how SoundCloud uses Cassandra. She describes a couple of Cassandra instances, from the point of view of the products and functionality they support.

57:10

Customer Insight, from Data to Information

Posted by Thore Thomassen  on  Mar 27, 2015

Thore Thomassen shares from experience how to combine structured data in a DWH with unstructured data in NoSQL, and using parallel data warehouse appliances to boost the analytical capabilities.

38:03

Efficient Data Storage for Analytics with Parquet 2.0

Posted by Julien Le Dem  on  Mar 22, 2015

Julien Le Dem discusses the advantages of a columnar data layout, specifically the features and design choices Apache Parquet uses to achieve goals of interoperability, space and query efficiency.

01:28:53

GORM Inside and Out

Posted by Jeff Scott Brown  on  Mar 21, 2015

Jeff Scott Brown introduces GORM, a super powerful ORM tool that makes ORM simple by leveraging the flexibility and expressiveness of a dynamic language like Groovy.

33:44

Programming and Testing a Distributed Database

Posted by Reid Draper  on  Mar 20, 2015

Reid Draper shows how real world distributed database work, communicate and are tested, trading RPC for messaging, unit-tests for QuickCheck, and micro-benchmarks for multi-week stress tests.

01:23:00

Using a Graph Database for JVM Heap Analysis

Posted by James Richardson, Nat Pryce  on  Mar 19, 2015 2

James Richardson, Nat Pryce discuss some of the challenges faced using Neo4J for interactive analysis of large data imports (80K nodes, 150k relationships) and how they overcame them.

01:06:43

Big Data in Memory

Posted by John Davies  on  Mar 14, 2015

John Davies shows a Spring work-flow consuming 7.4kB XML messages, binding them to 25kB Java but storing them in just 450 bytes each, 10 million derivative contracts in-memory on a laptop.

44:13

Gobblin: A Framework for Solving Big Data Ingestion Problem

Posted by Lin Qiao  on  Mar 12, 2015

Lin Qiao discusses the architecture of Gobblin, LinkedIn’s framework for addressing the need of high quality and high velocity data ingestion.

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2015 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT