InfoQ Homepage Database Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

In-Process Analytical Data Management with DuckDB

Hannes Mühleisen discusses DuckDB, an analytical data management system that is built for an in-process use case. DuckDB speaks SQL, is integrated as a library, and uses query processing techniques.

Hannes Mühleisen
on Feb 28, 2024

Icon

28:11
AI, ML & Data Engineering

PRQL: a Simple, Powerful, Pipelined SQL Replacement

Aljaž Mur Eržen discusses PRQL, a language that can be compiled to most SQL dialects, which makes it portable and reusable, important factors of OLAP.

Aljaž Mur Eržen
on Feb 08, 2024

Icon

49:13
AI, ML & Data Engineering

Ephemeral Execution is the Future of Computing, but What about the Data?

Jerop Kipruto and Christie Warwick use Tekton to explore challenges of data gravity in ephemeral execution, discussing clean container injection mechanisms and a secure server interface.

Jerop Kipruto Christie Warwick
on Feb 06, 2024

Icon

40:17
AI, ML & Data Engineering

Amazon DynamoDB Distributed Transactions at Scale

Akshat Vig explains how transactions were added to Amazon DynamoDB using a timestamp-based ordering protocol to achieve low latency for both transactional and non-transactional operations.

Akshat Vig
on Jan 19, 2024

Icon

49:07
AI, ML & Data Engineering

Needle in a 930M Member Haystack: People Search AI @LinkedIn

Mathew Teoh explores how LinkedIn's People Search system uses ML to surface the right person that you're looking for.

Mathew Teoh
on Dec 28, 2023

Icon

50:57
AI, ML & Data Engineering

PostgresML: Leveraging Postgres as a Vector Database for AI

Montana Low provides an understanding of how Postgres can be used as a vector database for AI and how it can be integrated into your existing application stack.

Montana Low
on Nov 30, 2023

Icon

48:53
AI, ML & Data Engineering

LLMs in the Real World: Structuring Text with Declarative NLP

Adam Azzam discusses why building machine learning pipelines to extract structured data from unstructured text is a popular problem within an unpopular development lifecycle.

Adam Azzam
on Nov 04, 2023

Icon

50:00
AI, ML & Data Engineering

Performance and Scale - Domain-Oriented Objects vs Tabular Data Structures

Donald Raab and Rustam Mehmandarov discuss three library solutions for managing data based on an example of high-performance CSV processing.

Donald Raab Rustam Mehmandarov
on Oct 30, 2023

Icon

43:54
AI, ML & Data Engineering

What is Derived Data? (and Do You Already Have Any?)

Felix GV explains what derived data is, and dives into four major use cases which fit in the derived data bucket, including: graphs, search, OLAP and ML feature storage.

Felix GV
on Aug 25, 2023

Icon

50:00
AI, ML & Data Engineering

Speed of Apache Pinot at the Cost of Cloud Object Storage with Tiered Storage

Neha Pawar discusses how to query data on the cloud directly with sub-seconds latencies, diving into data fetch and optimization strategies, challenges faced and learnings.

Neha Pawar
on Aug 16, 2023

Icon

43:39
AI, ML & Data Engineering

A New Era for Database Design with TigerBeetle

Joran Dirk Greef discusses pivotal moments in database design and how they influenced the design decisions for TigerBeetle, a distributed financial accounting database.

Joran Dirk Greef
on Aug 04, 2023

Icon

50:03
Cloud

Azure Cosmos DB: Low Latency and High Availability at Planet Scale

Mei-Chin Tsai and Vinod Sridharan discuss the internal architecture of Azure Cosmos DB and how it achieves high availability, low latency, and scalability.

Mei-Chin Tsai Vinod Sridharan
on Jul 15, 2023

Icon

49:23

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations