INTERACTIVE COURSES
Searching by meaning: vector databases and retrieval
An accessible university-level course on semantic search. We start from the limit of keyword search, which ignores meaning, and rebuild search by meaning: represent a text as a vector, measure proximity, index millions of vectors without comparing them all, make the index durable, marry it with the lexical world, and finally feed a language model. From the geometry of embeddings to RAG.
- 00 8 minForewordWhy search by meaning, what this course covers, and how to read it.
- 01 26 minEmbeddings and the geometry of similarityMeaning as a position in space, and three ways to measure how close two meanings are.
- 02 28 minExact search and the curse of dimensionalityComparing every vector gives the perfect answer. Here is its price, and the trap that high dimension sets for our intuition.
- 03 30 minHNSW: navigating a proximity graphWhat if finding the nearest neighbor became a short stroll of a few hops, instead of a scan across millions of vectors?
- 04 28 minThe landscape of ANN indexesFour index families, three riches you can never keep all at once: how to choose between recall, speed and memory?
- 05 26 minTesting the approximate: the differential oracleAn index can pass every test, two reviews, and still return bad results. How do you catch an algorithm that lies?