InfoQ Homepage Big Data Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Robust Foundation for Data Pipelines at Scale - Lessons from Netflix

Jun He and Harrington Joseph share their experiences of building and operating the orchestration platform for Netflix’s big data ecosystem.

Jun He Harrington Joseph
on Dec 16, 2021

Icon

38:17
Culture & Methods

Privacy Architecture for Data-Driven Innovation

Nishant Bhajaria discusses how to set up a privacy program and shares tips on how to influence engineering and other teams to own their data and its usage so that privacy is a shared goal.

Nishant Bhajaria
on Apr 23, 2020

Icon

51:49
AI, ML & Data Engineering

What Does It Mean to Be a Data Scientist? Definitions and Lessons Learned from the Trenches

Brian Korzynski discusses what Data Science and Big Data are, focusing on the data preparation that needs to take place, and making a distinction between ML issues and programming.

Brian Korzynski
on Mar 29, 2020

Icon

52:56
AI, ML & Data Engineering

Big Data Legal Issues. GDPR and Contracts

Anton Tarasiuk discusses the legal issues that can be encountered when dealing with Big Data, GDPR and contracts.

Anton Tarasiuk
on Mar 27, 2020

Icon

19:53
AI, ML & Data Engineering

Big Data's Ethical Drought: The Thirst for More Data Has Led to a Lapse in Ethics and Privacy

Katharine Jarmul provides examples of data (mis)use and asking how we can work with data without violating the trust and privacy of users, producing an ethical product?

Katharine Jarmul
on Oct 17, 2019

Icon

53:00
AI, ML & Data Engineering

Putting the Spark in Functional Fashion Tech Analytics

Gareth Rogers shows how his team used Clojure to provide a solid platform to connect and manage an AWS hosted analytics pipeline and the pitfalls they encountered on the way.

Gareth Rogers
on Jul 30, 2019

Icon

35:50
AI, ML & Data Engineering

Apache Metron in the Real World – Big Data and Cybersecurity, a Perfect Match

Dave Russell takes a look at a number of different organizations who are on their big data cybersecurity journey with Apache Metron.

Dave Russell
on Jun 18, 2019

Icon

35:59
AI, ML & Data Engineering

Petastorm: A Light-Weight Approach to Building ML Pipelines

Yevgeni Litvin describes how Petastorm facilitates tighter integration between Big Data and Deep Learning worlds, simplifies data management and data pipelines, and speeds up model experimentation.

Yevgeni Litvin
on Jun 11, 2019

Icon

42:45
AI, ML & Data Engineering

People You May Know: Fast Recommendations over Massive Data

Sumit Rangwala and Felix GV present the evolution of PYMK’s architecture, focusing on Gaia, a real-time graph computing capability, and Venice, an online feature store with scoring capability.

Sumit Rangwala Felix GV
on Jun 05, 2019

Icon

39:12
AI, ML & Data Engineering

Productionizing H2O Models with Apache Spark

Jakub Hava demonstrates the creation of pipelines integrating H2O machine learning models and their deployments using Scala or Python.

Jakub Hava
on May 09, 2019

Icon

34:50
AI, ML & Data Engineering

Winning Ways for Your Visualization Plays

Mark Grundland explores practical techniques for information visualization design to take better account of the fundamental limitations of visual perception.

Mark Grundland
on Apr 15, 2019

Icon

38:56
Architecture & Design

Migrating from Big Data Architecture to Spring Cloud

Lenny Jaramillo discusses how Northern Trust migrated to PCF, highlighting how this helped them accelerate the delivery of functionality to their customers.

Lenny Jaramillo
on Jan 04, 2019

Icon

29:45

Newer Presentations

Older Presentations

Topics

Scaling Challenges: Productivity, Cost Efficiency, and Microservice Management

Cell-Based Architecture

Developer Experience in the Age of Generative AI

Platform Engineering – Making Other Teams 10x Better

Pulumi Adventures: How Python Empowered My Infrastructure beyond YAML

Helpful links

Choose your language

Presentations

Robust Foundation for Data Pipelines at Scale - Lessons from Netflix

Privacy Architecture for Data-Driven Innovation

What Does It Mean to Be a Data Scientist? Definitions and Lessons Learned from the Trenches

Big Data Legal Issues. GDPR and Contracts

Big Data's Ethical Drought: The Thirst for More Data Has Led to a Lapse in Ethics and Privacy

Putting the Spark in Functional Fashion Tech Analytics

Apache Metron in the Real World – Big Data and Cybersecurity, a Perfect Match

Petastorm: A Light-Weight Approach to Building ML Pipelines

People You May Know: Fast Recommendations over Massive Data

Productionizing H2O Models with Apache Spark

Winning Ways for Your Visualization Plays

Migrating from Big Data Architecture to Spring Cloud

QCon London: Netflix Saves Time and Money with Server-Driven Notifications

CrowdStrike Update Bricks Estimated 8.5M Windows Machines Worldwide

Scaling Challenges: Productivity, Cost Efficiency, and Microservice Management

Navigating Software Architecture at Scale: Insights from Decathlon’s Architecture Process

Cell-Based Architecture

Queue Support for Apache Kafka: KIP-932 and KMQ from SoftwareMill

Platform Engineering – Making Other Teams 10x Better

How Team Health Checks Help Software Teams to Deliver

Managing Staff+ Engineers: Opportunities and Challenges

AWS Introduces Amazon Q Developer in SageMaker Studio to Streamline ML Workflows

OpenAI Releases GPT-4o mini Model with Improved Jailbreak Resistance

Redis Improves Performance of Vector Semantic Search with Multi-Threaded Query Engine

RADIUS Protocol Vulnerability Exposes Network Device Authentication

HashiCorp Releases Consul 1.19 with Enhanced Kubernetes and Nomad Integration

Ngrok Traffic Inspector Provides Observability for Network Traffic

InfoQ Live Roundtable

InfoQ Dev Summit Munich

QCon San Francisco

QCon London

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Presentations