InfoQ Homepage Database Content on InfoQ

Presentations

RSS Feed

Newer Older

Architecture & Design

Serverless Design Patterns with AWS Lambda: Big Data with Little Effort

Tim Wagner discusses Big Data on serverless, showing working examples and how to set up a CI/CD pipeline, demonstrating AWS Lambda with the Serverless Application Model (SAM).

Tim Wagner
on Jul 29, 2017

Icon

50:45
Development

Power of the Log:LSM & Append Only Data Structures

Ben Stopford talks about the beauty of sequential access and append only data structures in the context of “Log Structured Merge Trees”.

Ben Stopford
on Jun 15, 2017

Icon

31:56
Architecture & Design

Applied Distributed Research in Apache Cassandra

Jonathan Ellis explains the challenges and successes Cassandra has had in creating transactions, materialized views, and a strongly consistent cluster membership within this peer-to-peer paradigm.

Jonathan Ellis
on Jun 10, 2017

Icon

01:09:27
AI, ML & Data Engineering

Scio: Moving Big Data to Google Cloud, a Spotify Story

Neville Li tells the Spotify’s story of migrating their big data infrastructure to Google Cloud, replacing Hive and Scalding with BigQuery and Scio, which helped them iterate faster.

Neville Li
on May 26, 2017

Icon

54:50
Architecture & Design

In-Memory Caching: Curb Tail Latency with Pelikan

Yao Yue introduces Pelikan - a framework to implement distributed caches such as Memcached and Redis. She discusses the system aspects that are important to the performance of such services.

Yao Yue
on May 02, 2017

Icon

47:56
AI, ML & Data Engineering

Data Preparation for Data Science: A Field Guide

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

Casey Stella
on Apr 23, 2017

Icon

45:00
Architecture & Design

Building Reliability in an Unreliable World

Greg Murphy describes how GameSparks has designed their platform to be tolerant of many things: unreliable and slow internet connectivity, cloud resources that can fail without warning, and more.

Greg Murphy
on Apr 20, 2017

Icon

50:39
AI, ML & Data Engineering

AI from an Investment Perspective

The panelists discuss AI from an investment perspective, the challenges, the risks, trends, the role of Deep Learning, successful AI use cases, and more.

Pankaj Mitra Doug Dooley Sanjit Dang Kiersten Stead Yashwanth Hemaraj Leonard Speiser Kartik Gada
on Apr 18, 2017

Icon

42:48
Architecture & Design

Causal Consistency for Large Neo4j Clusters

Jim Webber explores the new Causal clustering architecture for Neo4j, how it allows users to read writes straightforwardly, explaining why this is difficult to achieve in distributed systems.

Jim Webber
on Apr 07, 2017

Icon

49:40
AI, ML & Data Engineering

Big Data Infrastructure @ LinkedIn

Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.

Shirshanka Das
on Apr 02, 2017

Icon

50:48
Development

Performance and Search

Dan Luu discusses how to estimate performance using back of the envelope calculations that can be done in minutes or hours, even for applications that take months or years to implement.

Dan Luu
on Apr 01, 2017

Icon

41:09
Architecture & Design

Scaling up Near Real-Time Analytics @Uber &LinkedIn

Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.

Yi Pan Chinmay Soman
on Mar 30, 2017

Icon

46:03

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations