Semester: | Spring 2021, also offered on Fall 2019 and Fall 2018 |
Time and place: | Monday, Wednesday and Friday, 10.30am-11.20am EST |
Instructor: | Jean Honorio (Please send an e-mail for appointments) |
TAs: |
Kevin Bello, email: kbellome at purdue.edu, Office hours: Monday 1pm-3pm EST Prerit Gupta, email: gupta596 at purdue.edu, Office hours: Friday 2pm-4pm EST Chuyang Ke, email: cke at purdue.edu, Office hours: Tuesday 2pm-4pm EST Jin Son, email: son74 at purdue.edu, Office hours: Thursday 3pm-5pm EST Anxhelo Xhebraj, email: axhebraj at purdue.edu, Office hours: Wednesday noon-2pm EST Kaiyuan Zhang, email: zhan4057 at purdue.edu, Office hours: Tuesday 10am-noon EST |
Date | Topic (Tentative) | Notes |
Wed, Jan 20 | Lecture 1: introduction | Python |
Fri, Jan 22 | Lecture 2: probability review (joint, marginal and conditional probabilities) | |
Mon, Jan 25 |
(lecture continues) Lecture 3: statistics review (independence, maximum likelihood estimation) |
|
Wed, Jan 27 | (lecture continues) | |
Fri, Jan 29 | Lecture 4: linear algebra review |
Linear algebra in Python Homework 1: due on Feb 5, 11.59pm EST |
Mon, Feb 1 | Lecture 5: elements of data mining and machine learning algorithms | |
Wed, Feb 3 |
(lecture continues) Lecture 6: linear classification, perceptron |
|
Fri, Feb 5 | — | Homework 1 due |
Mon, Feb 8 | (lecture continues) | |
Wed, Feb 10 |
(lecture continues) Lecture 7: perceptron (convergence), support vector machines (introduction) |
|
Fri, Feb 12 | (lecture continues) | Homework 2: due on Feb 19, 11.59pm EST |
Mon, Feb 15 | (lecture continues) | |
Wed, Feb 17 | READING DAY | |
Fri, Feb 19 | Lecture 8: generative probabilistic modeling, maximum likelihood estimation, classification | Homework 2 due |
Mon, Feb 22 |
(lecture continues) Lecture 9: generative probabilistic classification (naive Bayes), non-parametric methods (nearest neighbors) |
Homework 3: due on Mar 1, 11.59pm EST |
Web, Feb 24 |
(lecture continues) Lecture 10: non-parametric methods (classification trees) |
|
Fri, Feb 26 | (lecture continues) | |
Mon, Mar 1 | Case Study 1 | Homework 3 due |
Wed, Mar 3 |
(lecture continues) Lecture 11: performance measures, cross-validation, statistical hypothesis testing |
|
Fri, Mar 5 |
(lecture continues) Lecture 12: model selection and generalization (VC dimension) |
Homework 4: due on Mar 12, 11.59pm EST |
Mon, Mar 8 | (lecture continues) | |
Wed, Mar 10 | Case Study 2 | |
Fri, Mar 12 |
(lecture continues) Lecture 13: dimensionality reduction, principal component analysis (PCA) |
Homework 4 due |
Mon, Mar 15 | (lecture continues) | |
Wed, Mar 17 | MIDTERM (lectures 1 to 12, all case studies) |
Start: Wednesday March 17, 10.30am EST End: Thursday March 18, 10.30am EST |
Fri, Mar 19 |
(lecture continues) (midterm solution) |
Homework 5: due on Mar 26, 11.59pm EST |
Mon, Mar 22 | Lecture 14: nonlinear feature mappings, kernels, kernel perceptron, kernel support vector machines | |
Wed, Mar 24 | (lecture continues) | |
Fri, Mar 26 | Lecture 15: ensemble methods: bagging, boosting, bias/variance tradeoff |
Homework 5 due Homework 6: due on Apr 2, 11.59pm EST |
Mon, Mar 29 |
(lecture continues) Case Study 3 |
Project plan due ([Word] or [Latex] format) |
Wed, Mar 31 | (lecture continues) | |
Fri, Apr 2 | — | Homework 6 due |
Mon, Apr 5 | Lecture 16: clustering, k-means, hierarchical clustering | Homework 7: due on Apr 12, 11.59pm EST |
Wed, Apr 7 |
(lecture continues) Lecture 17: clustering, mixture models, expectation-maximization (EM) algorithm |
|
Fri, Apr 9 |
(lecture continues) Lecture 18: anomaly detection, one-class support vector machines |
|
Mon, Apr 12 | (lecture continues) | Homework 7 due |
Wed, Apr 14 | Lecture 19: Bayesian networks (independence) | |
Fri, Apr 16 |
(lecture continues) Lecture 20: pattern discovery, association rules, frequent itemsets |
Preliminary project report, due on Apr 16, 11.59pm EST |
Mon, Apr 19 |
(lecture continues) Lecture 21: feature selection (univariate/multivariate, filter/wrapper/embedded methods, L1-norm regularization) |
|
Wed, Apr 21 | (lecture continues) | |
Fri, Apr 23 |
(lecture continues) Lecture 22: data quality, preprocessing, visualization, distances |
|
Mon, Apr 26 | FINAL EXAM (lectures 13 to 21, all case studies) |
Start: Monday April 26, 10.30am EST End: Tuesday April 27, 10.30am EST |
Wed, Apr 28 | (lecture continues) | |
Fri, Apr 30 | (final exam solution) | Final project report, due on Apr 30, 11.59pm EST |