InfoQ Homepage Big Data Infrastructure Content on InfoQ

Presentations

RSS Feed

AI, ML & Data Engineering

Big Data Infrastructure @ LinkedIn

Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.

Shirshanka Das
on Apr 02, 2017

Icon

50:48
DevOps

Petabytes Scale Analytics Infrastructure @Netflix

Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.

Dan Weeks Tom Gianos
on Feb 15, 2017

Icon

45:26
The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

Randy Shoup describes KIXEYE's analytics infrastructure from Kafka queues through Hadoop 2 to Hive and Redshift, built for flexibility, experimentation, iteration, testability, and reliability.

Randy Shoup
on Jul 19, 2014

Icon

51:04
Data & Infrastructure at Airbnb

Brenden Matthews describes the infrastructure built at Airbnb using Mesos in order to support Hadoop and Storm.

Brenden Matthews
on Dec 31, 2013

Icon

46:49
Making the Internet a Better Place: Scaling AppNexus

Mike Nolet shares lessons learned scaling AppNexus and architectural details of their system processing 30TB/day: Hadoop, DNS built in GSLB and Keepalived, and real-time data streaming built in C.

Mike Nolet
on Oct 18, 2013

Icon

45:11
Lean Data Architecture: Minimize Investment, Maximize Value

Manvir Singh Grewal and Brandon Byars propose a business intelligence workflow along with Lean principles and practices for implementing a data warehouse and reporting capability.

Brandon Byars Manvir Singh Grewal
on Jan 04, 2013

Icon

53:59
Big Data Problems in Monitoring at eBay

Bhaven Avalani and Yuri Finklestein discuss 4 aspects encountered at eBay when dealing with monitoring data: reduction of data entropy, robust data distribution, metric extraction, efficient storage.

Bhaven Avalani Yuri Finklestein
on Dec 21, 2012

Icon

50:16
Big Data, Small Computers

Cliff Click discusses RAIN, H2O, JMM, Parallel Computation, Fork/Joins in the context of performing big data analysis on tons of commodity hardware.

Cliff Click
on Dec 20, 2012

Icon

22:53
Facebook News Feed: Social Data at Scale

Serkan Piantino discusses news feeds at Facebook: the basics, infrastructure used, how feed data is stored, and Centrifuge – a storage solution.

Serkan Piantino
on Nov 26, 2012

Icon

40:57
Saving the World (from|with) Big Data

Bruce Durling discusses the impact of cloud computing on the climate and what can be done to reduce the amount of CO2 generated by data centers in order to process big data.

Bruce Durling
on Sep 20, 2012

Icon

25:20
"Big Data" and the Future of DevOps

Ram C Singh discusses using Big Data for infrastructure telemetry along with good practices and an autonomic engine to create an autonomic computing infrastructure that might prevent downtime.

Ram Singh
on Aug 24, 2012

Icon

25:05

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Expanding Swift from Apps to Services

You’ve Generated Your MVP Using AI. What Does That Mean for Your Software Architecture?

Building Embedding Models for Large-Scale Real-World Applications

Beyond Code: How Engineers Need to Evolve in the AI Era

From Alert Fatigue to Agent-Assisted Intelligent Observability

Helpful links

Choose your language

Presentations

Big Data Infrastructure @ LinkedIn

Petabytes Scale Analytics Infrastructure @Netflix

The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

Data & Infrastructure at Airbnb

Making the Internet a Better Place: Scaling AppNexus

Lean Data Architecture: Minimize Investment, Maximize Value

Big Data Problems in Monitoring at eBay

Big Data, Small Computers

Facebook News Feed: Social Data at Scale

Saving the World (from|with) Big Data

"Big Data" and the Future of DevOps

WhatsApp Deploys Rust-Based Media Parser to Block Malware on 3 Billion Devices

GitHub Copilot SDK Lets Developers Integrate Copilot CLI's Engine into Apps

How CNAME Ordering in RFC Specs Caused Cloudflare 1.1.1.1 Outage

OpenAI Scales Single Primary PostgreSQL Instance to Millions of Queries per Second for ChatGPT

You’ve Generated Your MVP Using AI. What Does That Mean for Your Software Architecture?

[Video Podcast] The Craft of Software Architecture in the Age of AI Tools

Beyond Code: How Engineers Need to Evolve in the AI Era

Creating Impactful Teams through Diversity Using Session 0

Scaling to 100+ as a Director: Lessons From Growing Engineering Organizations

Sixteen Claude Agents Built a C Compiler Without Human Intervention... Almost

Building Embedding Models for Large-Scale Real-World Applications

VillageSQL Launches as an Extension-Focused MySQL Fork

From Paging to Postmortem: Google Cloud SREs on Using Gemini CLI for Outage Response

Teleport Launches Agentic Identity Framework to Secure AI Agents Across Enterprise Infrastructure

Kubernetes Drives AI Expansion as Cultural Shift Becomes Critical

QCon London

QCon AI Boston

QCon San Francisco

InfoQ Software Architects' Newsletter

Presentations