InfoQ Homepage Database Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

Amazon DynamoDB Distributed Transactions at Scale

Akshat Vig explains how transactions were added to Amazon DynamoDB using a timestamp-based ordering protocol to achieve low latency for both transactional and non-transactional operations.

Akshat Vig
on Jan 19, 2024

Icon

49:07
AI, ML & Data Engineering

Needle in a 930M Member Haystack: People Search AI @LinkedIn

Mathew Teoh explores how LinkedIn's People Search system uses ML to surface the right person that you're looking for.

Mathew Teoh
on Dec 28, 2023

Icon

50:57
AI, ML & Data Engineering

PostgresML: Leveraging Postgres as a Vector Database for AI

Montana Low provides an understanding of how Postgres can be used as a vector database for AI and how it can be integrated into your existing application stack.

Montana Low
on Nov 30, 2023

Icon

48:53
AI, ML & Data Engineering

LLMs in the Real World: Structuring Text with Declarative NLP

Adam Azzam discusses why building machine learning pipelines to extract structured data from unstructured text is a popular problem within an unpopular development lifecycle.

Adam Azzam
on Nov 04, 2023

Icon

50:00
AI, ML & Data Engineering

Performance and Scale - Domain-Oriented Objects vs Tabular Data Structures

Donald Raab and Rustam Mehmandarov discuss three library solutions for managing data based on an example of high-performance CSV processing.

Donald Raab Rustam Mehmandarov
on Oct 30, 2023

Icon

43:54
AI, ML & Data Engineering

What is Derived Data? (and Do You Already Have Any?)

Felix GV explains what derived data is, and dives into four major use cases which fit in the derived data bucket, including: graphs, search, OLAP and ML feature storage.

Felix GV
on Aug 25, 2023

Icon

50:00
AI, ML & Data Engineering

Speed of Apache Pinot at the Cost of Cloud Object Storage with Tiered Storage

Neha Pawar discusses how to query data on the cloud directly with sub-seconds latencies, diving into data fetch and optimization strategies, challenges faced and learnings.

Neha Pawar
on Aug 16, 2023

Icon

43:39
AI, ML & Data Engineering

A New Era for Database Design with TigerBeetle

Joran Dirk Greef discusses pivotal moments in database design and how they influenced the design decisions for TigerBeetle, a distributed financial accounting database.

Joran Dirk Greef
on Aug 04, 2023

Icon

50:03
Cloud

Azure Cosmos DB: Low Latency and High Availability at Planet Scale

Mei-Chin Tsai and Vinod Sridharan discuss the internal architecture of Azure Cosmos DB and how it achieves high availability, low latency, and scalability.

Mei-Chin Tsai Vinod Sridharan
on Jul 15, 2023

Icon

49:23
Cloud

Amazon DynamoDB: Evolution of a Hyperscale Cloud Database Service

Akshat Vig presents Amazon’s experience operating DynamoDB at scale and how the architecture continues to evolve to meet the ever-increasing demands of customer workloads.

Akshat Vig
on Jun 16, 2023

Icon

52:54
AI, ML & Data Engineering

How Do You Distribute Your Database over Hundreds of Edge Locations?

Erwin van der Koogh explains a new model that Cloudflare has developed to distribute a database over hundreds of locations, and where it could go next.

Erwin van der Koogh
on Feb 10, 2022

Icon

39:14
AI, ML & Data Engineering

Robust Foundation for Data Pipelines at Scale - Lessons from Netflix

Jun He and Harrington Joseph share their experiences of building and operating the orchestration platform for Netflix’s big data ecosystem.

Jun He Harrington Joseph
on Dec 16, 2021

Icon

38:17

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations