InfoQ Homepage Hive Content on InfoQ

Presentations

RSS Feed

AI, ML & Data Engineering

Data Science in the Cloud @StitchFix

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

Stefan Krawczyk
on Feb 17, 2017

Icon

40:48
AI, ML & Data Engineering

Streaming Live Data and the Hadoop Ecosystem

Oleg Zhurakousky discusses the Hadoop ecosystem – Hadoop, HDFS, Yarn-, and how projects such as Hive, Atlas, NiFi interact and integrate to support the variety of data used for analytics.

Oleg Zhurakousky
on Jan 29, 2017

Icon

33:53
AI, ML & Data Engineering

Achieving Mega-Scale Business Intelligence through Speed of Thought Analytics on Hadoop

Ian Fyfe discusses the different options for implementing speed-of-thought business analytics and machine learning tools directly on top of Hadoop.

Ian Fyfe
on Oct 26, 2016

Icon

30:29
The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

Randy Shoup describes KIXEYE's analytics infrastructure from Kafka queues through Hadoop 2 to Hive and Redshift, built for flexibility, experimentation, iteration, testability, and reliability.

Randy Shoup
on Jul 19, 2014

Icon

51:04
REEF: Retainable Evaluator Execution Framework

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

Rusty Sears
on Dec 10, 2013

Icon

38:11
Apache Tez: Accelerating Hadoop Query Processing

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

Arun Murthy Bikas Saha
on Dec 05, 2013

Icon

38:16
Big Data Platform as a Service at Netflix

Jeff Magnusson details some of Netflix' key services: Franklin, Sting and Lipstick.

Jeff Magnusson
on Nov 18, 2013

Icon

52:24
Petabyte Scale Data at Facebook

Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.

Dhruba Borthakur
on Dec 17, 2012

Icon

49:37
Hadoop and Cassandra, Sitting in a Tree ...

Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.

Jake Luciani
on May 30, 2012

Icon

45:41

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

From MCP and Vibe Coding to Harness Engineering: How AI Native Engineering Evolved in One Year

Increasing Users' Data Agency: from BlueSky's AT Protocol to the Local-First Software Movement

Understanding ML Model Poisoning: How It Happens and How to Detect It

Craig McLuckie on Culture as a Team's Operating System in the AI Era

The Time It Wasn't DNS

Helpful links

Choose your language

Presentations

Data Science in the Cloud @StitchFix

Streaming Live Data and the Hadoop Ecosystem

Achieving Mega-Scale Business Intelligence through Speed of Thought Analytics on Hadoop

The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

REEF: Retainable Evaluator Execution Framework

Apache Tez: Accelerating Hadoop Query Processing

Big Data Platform as a Service at Netflix

Petabyte Scale Data at Facebook

Hadoop and Cassandra, Sitting in a Tree ...

VS Code 1.123 Adds Two-Hour Extension Update Delay to Limit Supply Chain Attacks

GitHub Copilot Desktop App Targets Parallel Agentic Workflows

Vercel Labs Open-Sources Zero-Native: a Zig-Based Cross-Platform Native Application Framework

Inside Atlassian’s Forge Billing Architecture for Distributed Usage Tracking at Scale

Behind the Scenes: Block 450 JVM Repositories into Monorepo to Reduce Dependency Drift

From Camera to Cloud: Netflix’s Scalable Media Processing Pipeline

How Lightweight ADRs and Architectural Advice Forums Can Support Architectural Decisions

Craig McLuckie on Culture as a Team's Operating System in the AI Era

Building and Scaling a Platform with Project-as-a-Service

Understanding ML Model Poisoning: How It Happens and How to Detect It

Anthropic Reports Claude Now Handles 95% of Internal Analytics Queries

AI Agents to Make Sense of Data at OpenAI

The Time It Wasn't DNS

Microsoft Expands Azure Kubernetes Service with Bare Metal, Fleet Management and AI Infrastructure

How eBPF Empowers Developers to Observe Inside the Linux Kernel in a Safe and Unintrusive Way

Online InfoQ AI Engineering Certification

Online InfoQ Architect Certification

QCon San Francisco

QCon London 2027

InfoQ Software Architects' Newsletter

Presentations