InfoQ Homepage Data Content on InfoQ

Presentations

RSS Feed

Newer Older

Java

Are You Missing a Data Frame? The Power of Data Frames in Java

Vladimir Zakharov discusses the power of DataFrames in Java. He compares implementations like DataFrame-EC and Tablesaw against Python’s pandas, focusing on performance and memory efficiency.

Vladimir Zakharov
on Feb 09, 2026

Icon

49:15
AI, ML & Data Engineering

Powering Enterprise AI Applications with Data and Open Source Software

Francisco Javier Arceo explored Feast, the open-source feature store designed to address common data challenges in the AI/ML lifecycle, such as feature redundancy, and low-latency serving at scale.

Francisco Javier Arceo
on Dec 15, 2025

Icon

49:25
AI, ML & Data Engineering

Reliable Data Flows and Scalable Platforms: Tackling Key Data Challenges

Matthias Niehoff discusses bridging the gap between application and data engineering. Learn to apply software engineering best practices, embrace boring technologies, and simplify architecture.

Matthias Niehoff
on Nov 28, 2025

Icon

50:11
AI, ML & Data Engineering

Achieving Precision in AI: Retrieving the Right Data Using AI Agents

Adi Polak discusses achieving precision in GenAI by moving beyond RAG to Agentic RAG. She details agent patterns, feedback loops, and using data streaming architectures to scale real-time AI.

Adi Polak
on Nov 07, 2025

Icon

50:00
AI, ML & Data Engineering

The Data Backbone of LLM Systems

Paul Iusztin discusses the evolution of AI engineering, highlighting the shift from model training to foundational models. He shares insights on scalable LLM systems and optimizing RAG.

Paul Iusztin
on Sep 10, 2025

Icon

51:25
AI, ML & Data Engineering

Efficient Incremental Processing with Netflix Maestro and Apache Iceberg

Jun He discusses how to use an IPS to build more reliable, efficient, and scalable data pipelines, unlocking new data processing patterns.

Jun He
on Feb 07, 2025

Icon

44:32
Java

1BRC–Nerd Sniping the Java Community

Gunnar Morling discusses some of the tricks employed by the fastest solutions for processing a 13 GB input file within less than two seconds through parallelization and efficient memory access.

Gunnar Morling
on Oct 25, 2024

Icon

50:07
AI, ML & Data Engineering

Architecting for Data Products

Danilo Sato discusses what constitutes a data product and different types of data products, how data products support data architecture at different levels, skills and team topologies needed.

Danilo Sato
on Sep 19, 2024

Icon

49:18
AI, ML & Data Engineering

Incremental Data Processing with Apache Hudi

The presenters discuss an introduction to incremental data processing, contrasting it with the two prevalent processing models of today - batch and stream data processing.

Saketh Chintapalli Bhavani Sudha Saktheeswaran
on Aug 16, 2024

Icon

41:53
Architecture & Design

Understanding Architectures for Multi-Region Data Residency

Alex Strachan discusses challenges to build multi-region data storages, understanding why and when a business needs to do this, who are the real stakeholders, and who owns what.

Alex Strachan
on May 24, 2024

Icon

45:44
AI, ML & Data Engineering

Multi-Region Data Streaming with Redpanda

Michał Maślanka introduces the design of Redpanda’s Multi-Region feature, and describes how they leveraged Raft’s properties, a constraint solver, automatic data balancing, and tiered storage.

Michał Maślanka
on Mar 06, 2024

Icon

41:44
AI, ML & Data Engineering

Graph Learning at the Scale of Modern Data Warehouses

Subramanya Dulloor outlines an approach to addressing the challenges of warehouses and shows how to build an efficient and scalable end-to-end system for graph learning in data warehouses.

Subramanya Dulloor
on Feb 16, 2024

Icon

45:03

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations