How do vector (search) databases work? ft: turbopuffer

07/04/2025 1h 8min

Listen "How do vector (search) databases work? ft: turbopuffer"

Descargar episodio Ver en sitio original

Episode Synopsis

For memberships: join this channel as a member here:https://www.youtube.com/channel/UC_mGuY4g0mggeUGM6V1osdA/joinSummary:In this conversation, Kaivalya Apte and Simon Eskildsen talk about vector databases, particularly focusing on TurboPuffer. They discuss the importance of vector search, embeddings, and the challenges associated with building efficient search engines. The conversation covers various aspects such as cost considerations, chunking strategies, multi-tenancy, and performance optimization. Simon shares insights on the future of vector search and the significance of observability and metrics in database performance. The discussion emphasizes the need for practical application and experimentation in understanding these technologies.Chapters:00:00 Introduction to Vector Databases10:34 Understanding Vectors and Embeddings15:03 Example: Designing a Search Engine for Podcasts27:53 Scaling Challenges in Vector Search36:46 Indexing and Querying in TurboPuffer38:12 Understanding Indexing and Query Planning45:45 Exploring Index Types and Their Performance50:27 Data Ingestion and Embedding Retrieval54:19 Use Cases and Challenges in Vector Search01:01:22 Metrics and Observability in Vector Databases01:03:52 Future Trends in Vector Search and DatabasesReferences:How do build a database on Object Storage? https://youtu.be/RFmajOeUKnETurbopuffer https://turbopuffer.com/Continous Recall measurement: https://turbopuffer.com/blog/continuous-recallTurbopuffer architecture: https://turbopuffer.com/architecture

More episodes of the podcast The GeekNarrator

Databases and Engineering with @PlanetScale CEO - Sam Lambert 16/11/2025

What is TigerStyle? Principles behind TigerBeetle ft. Joran 16/11/2025

What makes Apache Pinot so Fast? 16/11/2025

You don't need Linux, Docker, k8s? Future with Unikernels ft. NanoVMs 25/10/2025

Modern, ultra fast PostgreSQL engineered from scratch? ft: CedarDB 25/10/2025

Building a new Database Query Optimiser - @cmu 29/07/2025

Fast Observability on S3 with Parseable 29/07/2025

How does AWS Lambda work? 29/07/2025

Breaking Distributed Systems with Kyle Kingsbury from Jepsen 29/07/2025

Are your Data Pipelines Complex? 07/04/2025

Ver todos los episodios

ZARZA We are Zarza, the prestigious firm behind major projects in information technology.

How do vector (search) databases work? ft: turbopuffer

Listen "How do vector (search) databases work? ft: turbopuffer"

Episode Synopsis

More episodes of the podcast The GeekNarrator

Internet as human right and its scope

Digital Natives: Children of today, Technologists of Tomorrow

Bandwidth: Broadband or Narrowband?

Personnel recruitment via Web

Deep web or Invisible Internet

Subdomains, a glance with the experts!

Free Internet, a prediction in Nostradamus style

Educational Technology: From traditional to digital

Localhost, there’s no place like 127.0.0.1

Googling with breathtaking tricks you ignore

Gray Hat Hacking, those with ambiguous ethics…

Internet Predators on the prowl

Dot COM: The Internet’s dominant TLD