InfoQ Homepage Database Content on InfoQ

Presentations

RSS Feed

Newer Older

Architecture & Design

Architecture & Algorithms Powering Search @ZocDoc

Brian D'Alessandro and Pedro Rubio talk about the patient friendly search system they have built at Zocdoc using various products from the AWS stack and custom Machine Learning pipelines.

Brian D'Alessandro Pedro Rubio
on Sep 04, 2017

Icon

49:47
AI, ML & Data Engineering

Orchestrating Chaos: Applying Database Research in the Wild

Peter Alvaro describes LDFI’s (Lineage-driven Fault Injection) theoretical roots in database research, presenting early results from the field and opportunities for near and long-term future research.

Peter Alvaro
on Aug 10, 2017

Icon

41:42
DevOps

Managing Thousands of Data Services @Heroku

Gabriel Enslein discusses the evolution of fleet orchestration, immutable infrastructure, security auditing for managing data services for many Salesforce customers.

Gabriel Enslein
on Aug 09, 2017

Icon

33:00
AI, ML & Data Engineering

Scaling with Apache Spark

Holden Karau looks at Apache Spark from a performance/scaling point of view and what’s needed to handle large datasets.

Holden Karau
on Aug 05, 2017

Icon

46:58
Architecture & Design

Managing Data in Microservices

Randy Shoup shares microservices managing data patterns from Google, eBay, and Stitch Fix., talking on the need to access the data only through microservice's interface, communicate through events.

Randy Shoup
on Aug 01, 2017

Icon

52:06
Architecture & Design

Serverless Design Patterns with AWS Lambda: Big Data with Little Effort

Tim Wagner discusses Big Data on serverless, showing working examples and how to set up a CI/CD pipeline, demonstrating AWS Lambda with the Serverless Application Model (SAM).

Tim Wagner
on Jul 29, 2017

Icon

50:45
Development

Power of the Log:LSM & Append Only Data Structures

Ben Stopford talks about the beauty of sequential access and append only data structures in the context of “Log Structured Merge Trees”.

Ben Stopford
on Jun 15, 2017

Icon

31:56
Architecture & Design

Applied Distributed Research in Apache Cassandra

Jonathan Ellis explains the challenges and successes Cassandra has had in creating transactions, materialized views, and a strongly consistent cluster membership within this peer-to-peer paradigm.

Jonathan Ellis
on Jun 10, 2017

Icon

01:09:27
AI, ML & Data Engineering

Scio: Moving Big Data to Google Cloud, a Spotify Story

Neville Li tells the Spotify’s story of migrating their big data infrastructure to Google Cloud, replacing Hive and Scalding with BigQuery and Scio, which helped them iterate faster.

Neville Li
on May 26, 2017

Icon

54:50
Architecture & Design

In-Memory Caching: Curb Tail Latency with Pelikan

Yao Yue introduces Pelikan - a framework to implement distributed caches such as Memcached and Redis. She discusses the system aspects that are important to the performance of such services.

Yao Yue
on May 02, 2017

Icon

47:56
AI, ML & Data Engineering

Data Preparation for Data Science: A Field Guide

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

Casey Stella
on Apr 23, 2017

Icon

45:00
Architecture & Design

Building Reliability in an Unreliable World

Greg Murphy describes how GameSparks has designed their platform to be tolerant of many things: unreliable and slow internet connectivity, cloud resources that can fail without warning, and more.

Greg Murphy
on Apr 20, 2017

Icon

50:39

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations