Purdue CS592: Databases for AI: Vector Databases
|
Course DescriptionVector databases have recently emerged as a hot topic in the broader realm of databases for AI. The surge of interest is largely fueled by large language models (LLMs), where vector databases help overcome inherent limitations such as hallucinations, lack of domain expertise, and the inability to incorporate real-time information. This is enabled by the new paradigm of Retrieval-Augmented Generation (RAG), in which vector databases act as external knowledge bases, delivering relevant context to LLMs via vector search. While vector search itself is not new, modern vector databases face a host of system-level challenges, which we will explore in depth in this course. In this seminar, we will first cover the fundamentals of vector databases. Then we will feature a series of invited talks and student presentations covering recent advances in vector databases. By the end of the course, students are expected to gain a solid understanding of the challenges, the state-of-the-art techniques, and the open problems in vector databases. |
Instructor
|
Logistics
|
Online communications
|
Schedule (More Speakers to Be Invited) |
Lecture |
Talk/Paper Title |
Presenter |
| Lec 01 (01/12) | Introduction on Vector Databases |
Jianguo Wang |
| Lec 02 (01/14) | Introduction on Vector Databases |
Jianguo Wang |
| Lec 03 (01/19) | Cancelled for business trip |
N/A |
| Lec 04 (01/21) | Cancelled for business trip |
N/A |
| Lec 05 (01/26) | Introduction on Vector Databases |
Jianguo Wang |
| Lec 06 (01/28) | Vectorchord DB |
Allen Zhou @ Vectorchord |
| Lec 07 (02/02) | Elasticsearch Vector Search |
Mayya Sharipova @ Elastic |
| Lec 08 (02/04) | pgvector |
Jonathan Katz @ Databricks |
| Lec 09 (02/09) | Vector DBs at Microsoft |
Harsha Simhadri @ Microsoft |
| Lec 10 (02/11) | Vector DBs at Microsoft |
Qi Chen @ Microsoft |
| Lec 11 (02/16) | MyVector |
Alkin Tezuysal @ Altinity |
| Lec 12 (02/18) | RAG on Financial Documents |
Xinyu Wang @ McGill University |
| Lec 13 (02/23) | RetroInfer Vector Store for LLM Inference |
Baotong Lu @ Microsoft |
| Lec 14 (02/25) | Chroma Vector Database |
Tanuj Nayak @ Chroma |
| Lec 15 (03/02) | RaBitQ |
Cheng Long @ NTU |
| Lec 16 (03/04) | TBD |
TBD |
| Lec 17 (03/09) | TBD |
TBD |
| Lec 18 (03/11) | Serverless Vector Database |
Daniel Barcelona-Pons @ URV |
| Lec 19 (03/16) | Cancelled for Spring break |
N/A |
| Lec 20 (03/18) | TBD |
TBD |
| Lec 21 (03/23) | TBD |
TBD |
| Lec 22 (03/25) | MongoDB Vector Search |
Chunbin Lin @ MongoDB |
| Lec 23 (03/30) | TBD |
TBD |
| Lec 24 (04/01) | TBD |
TBD |
| Lec 25 (04/06) | TBD |
TBD |
| Lec 26 (04/08) | TBD |
TBD |
| Lec 27 (04/13) | Cancelled for business trip |
N/A |
| Lec 28 (04/15) | Cancelled for business trip |
N/A |
| Lec 29 (04/20) | TBD |
TBD |
| Lec 30 (04/22) | TBD |
TBD |
| Lec 31 (04/27) | TBD |
TBD |
| Lec 32 (04/29) | ByteDance Vector Search |
Silu Huang @ ByteDance |