InfoQ Homepage Big Data Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Petastorm: A Light-Weight Approach to Building ML Pipelines

Yevgeni Litvin describes how Petastorm facilitates tighter integration between Big Data and Deep Learning worlds, simplifies data management and data pipelines, and speeds up model experimentation.

Yevgeni Litvin
on Jun 11, 2019

Icon

42:45
AI, ML & Data Engineering

People You May Know: Fast Recommendations over Massive Data

Sumit Rangwala and Felix GV present the evolution of PYMK’s architecture, focusing on Gaia, a real-time graph computing capability, and Venice, an online feature store with scoring capability.

Sumit Rangwala Felix GV
on Jun 05, 2019

Icon

39:12
AI, ML & Data Engineering

Productionizing H2O Models with Apache Spark

Jakub Hava demonstrates the creation of pipelines integrating H2O machine learning models and their deployments using Scala or Python.

Jakub Hava
on May 09, 2019

Icon

34:50
AI, ML & Data Engineering

Winning Ways for Your Visualization Plays

Mark Grundland explores practical techniques for information visualization design to take better account of the fundamental limitations of visual perception.

Mark Grundland
on Apr 15, 2019

Icon

38:56
Architecture & Design

Migrating from Big Data Architecture to Spring Cloud

Lenny Jaramillo discusses how Northern Trust migrated to PCF, highlighting how this helped them accelerate the delivery of functionality to their customers.

Lenny Jaramillo
on Jan 04, 2019

Icon

29:45
AI, ML & Data Engineering

Using Data Effectively: beyond Art and Science

Hilary Parker talks about approaches and techniques to collect the most useful data, analyze it in a scientific way, and use it most effectively to drive actions and decisions.

Hilary Parker
on Nov 28, 2018

Icon

41:57
AI, ML & Data Engineering

Big Data and Deep Learning: A Tale of Two Systems

Zhenxiao Luo explains how Uber tackles data caching in large-scale DL, detailing Uber’s ML architecture and discussing how Uber uses Big Data, concluding by sharing AI use cases.

Zhenxiao Luo
on Nov 15, 2018

Icon

38:19
AI, ML & Data Engineering

Accelerated Spark on Azure: Seamless and Scalable Hardware Offloads in the Cloud

Yuval Degani shows how hardware accelerations in Azure can be utilized to speed-up Spark jobs, with the aid of RDMA (Remote Direct Memory Access) support in the VM.

Yuval Degani
on Nov 03, 2018

Icon

38:06
AI, ML & Data Engineering

Implementing AutoML Techniques at Salesforce Scale

Matthew Tovbin shows how to build ML models using AutoML (Salesforce), including techniques for automatic data processing, feature generation, model selection, hyperparameter tuning and evaluation.

Matthew Tovbin
on Oct 28, 2018

Icon

39:16
Culture & Methods

Privacy Ethics – A Big Data Problem

Raghu Gollamudi broadly covers best practices with respect to Data Management aspects from mapping Enterprise data to applying Data Protection rules like GDPR at petabyte scale.

Raghu Gollamudi
on Aug 23, 2018

Icon

35:00
Culture & Methods

What is a Data Citizen?

Caitlin McDonald discusses how big data affects people online and the ethics to be considered when dealing with data.

Caitlin McDonald
on Aug 16, 2018

Icon

14:53
Culture & Methods

When Data Kills

Cori Crider shares insights from her investigations of US drone strikes in Yemen and Pakistan, and explores how misuse of mass surveillance data has claimed innocent lives.

Cori Crider
on Aug 10, 2018

Icon

25:49

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations