Talks
Invited talks and presentations
-
1
Towards Efficient Generative Large Language Model Serving: A Tutorial from Algorithms to Systems
- ICML Tutorial, Vienna, Austria, July 2024
-
2
Demystifying Data Management for Large Language Models
- SIGMOD, Santiago, Chile, June 2024
-
3
Toward Fast and Affordable Serving Systems for Large Language Models
- XTensor@ASPLOS, San Diego, USA, April 2024
- MLSys YPS, Santa Clara, USA, May 2024
- WAIC, Shanghai, China, July 2024
- Faster Inference of LLMs Seminar, Online, August 2024
-
4
SpotServe: Serving Generative Large Language Models on Preemptible Instances
- ChinaSys Fall, Online, December 2023
- ASPLOS, San Diego, USA, April 2024
-
5
SpecInfer: Accelerating Generative LLM Serving with Tree-based Speculative Inference and Token Verification.
- Microsoft Azure AI Talk, Online, November 2023
- ASPLOS, San Diego, USA, April 2024
-
6
Recent Advances in Data-Centric MLSys: A DBer's Perspective.
- Tencent DB-Talk, Online, August 2023
-
8
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism.
- ChinaSys Fall, Online, China, December 2022
- Jiqizhixin, Online, China, January 2023
- VLDB, Online, Canada, September 2023
-
9
Parcae: Proactive, Liveput-Optimized DNN Training on Preemptible Instances.
- Amazon Research Awards (ARA) Tech Talk, Online, USA, May 2023
-
10
When Sparsity Meets Distributed DL System: Efficient and Scalable Huge Embedding Model Training.
- Catalyst Group Meeting, Pittsburgh, USA, October 2022
- Tencent, Online, China, September 2022
- Baidu, OPPO, MetaX, Online, China, April 2022
- Jiqizhixin, Online, China, January 2022
-
11
Hetu: An Automatic Parallel Distributed Deep Learning Framework for Huge Model.
- Huawei Cloud InnovWave Talk, Online, China, April 2023
- CCF TCDB & Gauss 松鼠会, Online, China, April 2023
- BAAI Conference, Beijing, China, June 2022
- MSRA, Beijing, China, November 2021
- NDBC, Kunming, China, December 2019
-
13
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework.
- VLDB, Sydney, Australia, September 2022
- ChinaSys Winter, Xiamen, China, December 2021
- Huawei, Alibaba, ByteDance, October 2021
-
14
Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce.
- SIGMOD, Xiaan, China, June 2021
-
15
DeGNN: Improving Graph Neural Networks with Graph Decomposition.
- SIGKDD, Online, August 2021