InfoQ Homepage Infrastructure Content on InfoQ

Presentations

RSS Feed

Newer Older

DevOps

Scaling Instagram Infrastructure

Lisa Guo overviews Instagram's infrastructure, its history, multi-data center support, tuning uwsgi parameters for scaling, performance monitoring and diagnosis, and Django/Python upgrade.

Lisa Guo
on May 19, 2017

Icon

51:11
Architecture & Design

In-Memory Caching: Curb Tail Latency with Pelikan

Yao Yue introduces Pelikan - a framework to implement distributed caches such as Memcached and Redis. She discusses the system aspects that are important to the performance of such services.

Yao Yue
on May 02, 2017

Icon

47:56
AI, ML & Data Engineering

Data Preparation for Data Science: A Field Guide

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

Casey Stella
on Apr 23, 2017

Icon

45:00
DevOps

Challenging Perceptions of NHS IT

Edward Hiley, Dan Rathbone talk about how NHS Digital has built a highly secure and resilient system for processing patient data, applying techniques more often used in the cloud to bare metal servers

Dan Rathbone Edward Hiley
on Apr 20, 2017

Icon

49:01
AI, ML & Data Engineering

AI from an Investment Perspective

The panelists discuss AI from an investment perspective, the challenges, the risks, trends, the role of Deep Learning, successful AI use cases, and more.

Pankaj Mitra Doug Dooley Sanjit Dang Kiersten Stead Yashwanth Hemaraj Leonard Speiser Kartik Gada
on Apr 18, 2017

Icon

42:48
DevOps

Testing Programmable Infrastructure with Ruby

Matt Long talks about some approaches to environment infrastructure testing that his team at OpenCredo has created using Ruby.

Matt Long
on Apr 12, 2017

Icon

49:48
Architecture & Design

Causal Consistency for Large Neo4j Clusters

Jim Webber explores the new Causal clustering architecture for Neo4j, how it allows users to read writes straightforwardly, explaining why this is difficult to achieve in distributed systems.

Jim Webber
on Apr 07, 2017

Icon

49:40
AI, ML & Data Engineering

Big Data Infrastructure @ LinkedIn

Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.

Shirshanka Das
on Apr 02, 2017

Icon

50:48
AI, ML & Data Engineering

Real-Time Recommendations Using Spark Streaming

Elliot Chow discusses the data pipeline that they built with Kafka, Spark Streaming, and Cassandra to process Netflix user activities in real time for the Trending Now row.

Elliot Chow
on Mar 30, 2017

Icon

47:03
AI, ML & Data Engineering

Building a Data Science Capability from Scratch

Victor Hu covers the challenges, both technical and cultural, of building a data science team and capability in a large, global company.

Victor Hu
on Mar 23, 2017

Icon

49:06
AI, ML & Data Engineering

Data Science in the Cloud @StitchFix

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

Stefan Krawczyk
on Feb 17, 2017

Icon

40:48
DevOps

Petabytes Scale Analytics Infrastructure @Netflix

Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.

Dan Weeks Tom Gianos
on Feb 15, 2017

Icon

45:26

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations