Purdue CS592: Databases for AI: Vector Databases
(Spring 2026)




Course Description

Vector databases have recently emerged as a hot topic in the broader realm of databases for AI. The surge of interest is largely fueled by large language models (LLMs), where vector databases help overcome inherent limitations such as hallucinations, lack of domain expertise, and the inability to incorporate real-time information. This is enabled by the new paradigm of Retrieval-Augmented Generation (RAG), in which vector databases act as external knowledge bases, delivering relevant context to LLMs via vector search. While vector search itself is not new, modern vector databases face a host of system-level challenges, which we will explore in depth in this course.

In this seminar, we will first cover the fundamentals of vector databases. Then we will feature a series of invited talks and student presentations covering recent advances in vector databases. By the end of the course, students are expected to gain a solid understanding of the challenges, the state-of-the-art techniques, and the open problems in vector databases.




Instructor




Logistics

  • When: MW 4:30-5:45pm
  • Where: GRIS 133
  • Office hour: after class or make appointment
  • Pre-requisites: No prior experience with vector databases is required. However, familiarity with data structures (e.g., CS251), databases (e.g., CS348 or CS448), and introductory AI/ML (e.g., CS242 or CS243) will be helpful.



Online communications

  • We'll use Piazza, e.g., announcements, discussions, and Q&A.
  • We'll NOT use Brightspace except for sending emails occasionally.



Schedule (More Speakers to Be Invited)

Lecture

Talk/Paper Title

Presenter

Lec 01 (01/12) Introduction on Vector Databases
Jianguo Wang
Lec 02 (01/14) Introduction on Vector Databases
Jianguo Wang
Lec 03 (01/19) Cancelled for business trip
N/A
Lec 04 (01/21) Cancelled for business trip
N/A
Lec 05 (01/26) Introduction on Vector Databases
Jianguo Wang
Lec 06 (01/28) Vectorchord DB
Allen Zhou @ Vectorchord
Lec 07 (02/02) TBD
TBD
Lec 08 (02/04) pgvector
Jonathan Katz @ Databricks
Lec 09 (02/09) TBD
TBD
Lec 10 (02/11) TBD
TBD
Lec 11 (02/16) TBD
TBD
Lec 12 (02/18) TBD
TBD
Lec 13 (02/23) TBD
TBD
Lec 14 (02/25) TBD
TBD
Lec 15 (03/02) TBD
TBD
Lec 16 (03/04) TBD
TBD
Lec 17 (03/09) TBD
TBD
Lec 18 (03/11) TBD
TBD
Lec 19 (03/16) Cancelled for Spring break
N/A
Lec 20 (03/18) TBD
TBD
Lec 21 (03/23) TBD
TBD
Lec 22 (03/25) MongoDB Vector Search
Chunbin Lin @ MongoDB
Lec 23 (03/30) TBD
TBD
Lec 24 (04/01) TBD
TBD
Lec 25 (04/06) TBD
TBD
Lec 26 (04/08) TBD
TBD
Lec 27 (04/13) Cancelled for business trip
N/A
Lec 28 (04/15) Cancelled for business trip
N/A
Lec 29 (04/20) TBD
TBD
Lec 30 (04/22) TBD
TBD
Lec 31 (04/27) TBD
TBD
Lec 32 (04/29) ByteDance Vector Search
Silu Huang @ ByteDance