InfoQ Homepage Hive Content on InfoQ

Presentations

RSS Feed

AI, ML & Data Engineering

Data Science in the Cloud @StitchFix

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

Stefan Krawczyk
on Feb 17, 2017

Icon

40:48
AI, ML & Data Engineering

Streaming Live Data and the Hadoop Ecosystem

Oleg Zhurakousky discusses the Hadoop ecosystem – Hadoop, HDFS, Yarn-, and how projects such as Hive, Atlas, NiFi interact and integrate to support the variety of data used for analytics.

Oleg Zhurakousky
on Jan 29, 2017

Icon

33:53
AI, ML & Data Engineering

Achieving Mega-Scale Business Intelligence through Speed of Thought Analytics on Hadoop

Ian Fyfe discusses the different options for implementing speed-of-thought business analytics and machine learning tools directly on top of Hadoop.

Ian Fyfe
on Oct 26, 2016

Icon

30:29
The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

Randy Shoup describes KIXEYE's analytics infrastructure from Kafka queues through Hadoop 2 to Hive and Redshift, built for flexibility, experimentation, iteration, testability, and reliability.

Randy Shoup
on Jul 19, 2014

Icon

51:04
REEF: Retainable Evaluator Execution Framework

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

Rusty Sears
on Dec 10, 2013

Icon

38:11
Apache Tez: Accelerating Hadoop Query Processing

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

Arun Murthy Bikas Saha
on Dec 05, 2013

Icon

38:16
Big Data Platform as a Service at Netflix

Jeff Magnusson details some of Netflix' key services: Franklin, Sting and Lipstick.

Jeff Magnusson
on Nov 18, 2013

Icon

52:24
Petabyte Scale Data at Facebook

Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.

Dhruba Borthakur
on Dec 17, 2012

Icon

49:37
Hadoop and Cassandra, Sitting in a Tree ...

Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.

Jake Luciani
on May 30, 2012

Icon

45:41

Unlock the full InfoQ experience

Don't have an InfoQ account?

Topics

Expanding Swift from Apps to Services

You’ve Generated Your MVP Using AI. What Does That Mean for Your Software Architecture?

Building Embedding Models for Large-Scale Real-World Applications

Beyond Code: How Engineers Need to Evolve in the AI Era

From Alert Fatigue to Agent-Assisted Intelligent Observability

Helpful links

Choose your language

Presentations

Data Science in the Cloud @StitchFix

Streaming Live Data and the Hadoop Ecosystem

Achieving Mega-Scale Business Intelligence through Speed of Thought Analytics on Hadoop

The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE

REEF: Retainable Evaluator Execution Framework

Apache Tez: Accelerating Hadoop Query Processing

Big Data Platform as a Service at Netflix

Petabyte Scale Data at Facebook

Hadoop and Cassandra, Sitting in a Tree ...

WhatsApp Deploys Rust-Based Media Parser to Block Malware on 3 Billion Devices

GitHub Copilot SDK Lets Developers Integrate Copilot CLI's Engine into Apps

How CNAME Ordering in RFC Specs Caused Cloudflare 1.1.1.1 Outage

OpenAI Scales Single Primary PostgreSQL Instance to Millions of Queries per Second for ChatGPT

You’ve Generated Your MVP Using AI. What Does That Mean for Your Software Architecture?

[Video Podcast] The Craft of Software Architecture in the Age of AI Tools

Beyond Code: How Engineers Need to Evolve in the AI Era

Creating Impactful Teams through Diversity Using Session 0

Scaling to 100+ as a Director: Lessons From Growing Engineering Organizations

Sixteen Claude Agents Built a C Compiler Without Human Intervention... Almost

Building Embedding Models for Large-Scale Real-World Applications

VillageSQL Launches as an Extension-Focused MySQL Fork

From Paging to Postmortem: Google Cloud SREs on Using Gemini CLI for Outage Response

Teleport Launches Agentic Identity Framework to Secure AI Agents Across Enterprise Infrastructure

Kubernetes Drives AI Expansion as Cultural Shift Becomes Critical

QCon London

QCon AI Boston

QCon San Francisco

InfoQ Software Architects' Newsletter

Presentations