InfoQ Homepage Big Data Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Spreadsheets for Developers

Felienne Hermans presents various algorithms that outlining the power of Excel, showing that spreadsheets are fit for TDD and rapid prototyping.

Felienne Hermans
on Sep 11, 2015

Icon

01:21:47
AI, ML & Data Engineering

The Many Faces of Apache Kafka: How is Kafka Used in Practice

Neha Narkhede discusses how companies are using Apache Kafka and where it fits in the Big Data ecosystem.

Neha Narkhede
on Aug 27, 2015

Icon

42:09
Financial Modeling with Apache Spark: Calculating Value at Risk

Sandy Ryza aims to give a feel for what it is like to approach financial modeling with modern big data tools, using the Monte Carlo method for a a basic VaR calculation with Spark.

Sandy Ryza
on Jul 12, 2015

Icon

42:33
Lightning Fast Cluster Computing with Spark and Cassandra

Piotr Kołaczkowski discusses how they integrated Spark with Cassandra, how it was done, how it works in practice and why it is better than using a Hadoop intermediate layer.

Piotr Kołaczkowski
on Jun 17, 2015

Icon

49:53
Translating Imperative Code to MapReduce

The authors present an approach for automatic translation of sequential, imperative code into a parallel MapReduce framework using Mold, translating Java code to run on Apache Spark.

Cosmin Radoi Manu Sridharan Stephen J Fink Rodric Rabbah
on Jun 10, 2015

Icon

19:02
Understanding Cloud, Big Data, Mobile and Security – Do They Play Nicely Together?

Colin Mower discusses the challenges met using together Cloud, Big Data, Mobile and Security and how these can work together to achieve business value.

Colin Mower
on May 12, 2015

Icon

41:57
A Taste of Random Decision Forests on Apache Spark

Sean Owen introduces Spark, Scala and random decision forests, and demonstrates the process of analyzing a real-world data set with them.

Sean Owen
on Apr 28, 2015

Icon

48:14
Big Data in Memory

John Davies shows a Spring work-flow consuming 7.4kB XML messages, binding them to 25kB Java but storing them in just 450 bytes each, 10 million derivative contracts in-memory on a laptop.

John Davies
on Mar 14, 2015

Icon

01:06:43
Gobblin: A Framework for Solving Big Data Ingestion Problem

Lin Qiao discusses the architecture of Gobblin, LinkedIn’s framework for addressing the need of high quality and high velocity data ingestion.

Lin Qiao
on Mar 12, 2015

Icon

44:13
AI, ML & Data Engineering

Better Together - Using Spark and Redshift to Combine Your Data with Public Datasets

Eugene Mandel discusses challenges of conforming data sources and compares processing stacks: Hadoop+Redshift vs Spark, showing how the technology drives the way the problem is modeled.

Eugene Mandel
on Mar 12, 2015

Icon

35:16
High Performance Computing Contributions to the World of Big Data

Sharan Kalwani presents the history of HPC and the technologies and trends which have contributed to creating the world of big data, covering applications of HPC resulting in big data technologies.

Sharan Kalwani
on Jan 11, 2015

Icon

52:07
A Distributed Transactional Database on Hadoop

John Leach explains using HBase co-processors to support a full ANSI SQL RDBMS without modifying the core HBase source, showing how Hadoop/HBase can replace traditional RDBMS solutions.

John Leach
on Jan 02, 2015

Icon

55:55

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations