Purdue CS592: Databases for AI: Vector Databases
(Spring 2026)




Course Description

Vector databases have recently emerged as a hot topic in the broader realm of databases for AI. The surge of interest is largely fueled by large language models (LLMs), where vector databases help overcome inherent limitations such as hallucinations, lack of domain expertise, and the inability to incorporate real-time information. This is enabled by the new paradigm of Retrieval-Augmented Generation (RAG), in which vector databases act as external knowledge bases, delivering relevant context to LLMs via vector search. While vector search itself is not new, modern vector databases face a host of system-level challenges, which we will explore in depth in this course.

In this seminar, we will first cover the fundamentals of vector databases. Then we will feature a series of invited talks and student presentations covering recent advances in vector databases. By the end of the course, students are expected to gain a solid understanding of the challenges, the state-of-the-art techniques, and the open problems in vector databases.




Instructor




Logistics

  • When: MW 4:30-5:45pm
  • Where: GRIS 133
  • Office hour: after class or make appointment
  • Pre-requisites: No prior experience with vector databases is required. However, familiarity with data structures (e.g., CS251), databases (e.g., CS348 or CS448), and introductory AI/ML (e.g., CS242 or CS243) will be helpful.



Online communications

  • We'll use Piazza, e.g., announcements, discussions, and Q&A.
  • We'll NOT use Brightspace except for sending emails occasionally.



Schedule (More Speakers to Be Invited)

Lecture

Talk/Paper Title

Presenter

Lec 01 (01/12) Introduction on Vector Databases
Jianguo Wang
Lec 02 (01/14) Introduction on Vector Databases
Jianguo Wang
Lec 03 (01/19) Cancelled for business trip
N/A
Lec 04 (01/21) Cancelled for business trip
N/A
Lec 05 (01/26) Introduction on Vector Databases
Jianguo Wang
Lec 06 (01/28) Vectorchord DB
Allen Zhou @ Vectorchord
Lec 07 (02/02) Elasticsearch Vector Search
Mayya Sharipova @ Elastic
Lec 08 (02/04) pgvector
Jonathan Katz @ Databricks
Lec 09 (02/09) Vector DBs at Microsoft
Harsha Simhadri @ Microsoft
Lec 10 (02/11) Vector DBs at Microsoft
Qi Chen @ Microsoft
Lec 11 (02/16) MyVector
Alkin Tezuysal @ Altinity
Lec 12 (02/18) RAG on Financial Documents
Xinyu Wang @ McGill University
Lec 13 (02/23) RetroInfer Vector Store for LLM Inference
Baotong Lu @ Microsoft
Lec 14 (02/25) Chroma Vector Database
Tanuj Nayak @ Chroma
Lec 15 (03/02) RaBitQ
Cheng Long @ NTU
Lec 16 (03/04) TBD
TBD
Lec 17 (03/09) TBD
TBD
Lec 18 (03/11) Serverless Vector Database
Daniel Barcelona-Pons @ URV
Lec 19 (03/16) Cancelled for Spring break
N/A
Lec 20 (03/18) TBD
TBD
Lec 21 (03/23) TBD
TBD
Lec 22 (03/25) MongoDB Vector Search
Chunbin Lin @ MongoDB
Lec 23 (03/30) TBD
TBD
Lec 24 (04/01) TBD
TBD
Lec 25 (04/06) TBD
TBD
Lec 26 (04/08) TBD
TBD
Lec 27 (04/13) Cancelled for business trip
N/A
Lec 28 (04/15) Cancelled for business trip
N/A
Lec 29 (04/20) TBD
TBD
Lec 30 (04/22) TBD
TBD
Lec 31 (04/27) TBD
TBD
Lec 32 (04/29) ByteDance Vector Search
Silu Huang @ ByteDance