This is the course website for EE 510, Machine Learning Theory & Algorithms, Spring 2022 quarter.
Meeting time: Tue/Thu 4:40-6:30PM, FAB 150
Office hours: Tue/Thu 6:30-7:30PM (or by appointment), FAB 160-19
Course Description
The goal of this course is to provide a thorough understanding of the fundamental methodologies and algorithms used in machine learning. Students will learn to understand, implement, and innovate on algorithms for common tasks such as classification, regression, clustering, and dimensionality reduction. Algorithms covered include linear and nonlinear regression, ensemble methods, support vector machines, K-means, hierarchical clustering, and Gaussian mixture models. Theory covered will include empirical risk minimization, generalization bounds, bias-variance tradeoff, PAC learning, and VC dimension.
Students should have a firm understanding of linear algebra and probability, at the level of EE 516: Mathematical Foundations of Machine Learning and EE 520: Random Processes, respectively. Students should also be comfortable with programming in Python.
Textbook: The course will utilize the free textbooks below.
Syllabus: The syllabus can be found here.
Communication: All written questions should be posted to the appropriate channel on the Slack workspace (see Homework 0).
Course Schedule
This course will practice a flipped classroom. You are required to do the readings from Understanding Machine Learning (UML) below before the corresponding class time. Class time will be used as discussion of topics that students found most confusing or difficult. Readings from The Elements of Statistical Learning (ESL) are optional, and assignments are typically due Fridays at 11:59PM.
Date | Lecture | Topic | UML Pages | ESL Pages | Assignment Due (Friday) |
---|---|---|---|---|---|
3/29 | 1 | introduction, empirical risk minimization | 19-29, 33-41 | 9-38 | — |
3/31 | 2 | PAC learning | 43-50 | — | HW0 |
4/5 | 3 | uniform convergence | 54-58 | — | — |
4/7 | 4 | nearest neighbor rules | 258-265 | 463-480 | HW1 |
4/12 | 5 | bias-complexity tradeoff | 60-66 | 219-229 | — |
4/14 | 6 | VC-dimension | 67-78 | 237-240 | HW2 |
4/19 | 7 | (catch up) | — | — | — |
4/21 | 8 | boosting | 130-142 | 337-380 | MP1 |
4/26 | 9 | (catch up) | — | — | — |
4/28 | 10 | decision trees | 250-256 | 295-334 | HW3 |
5/3 | 11 | model selection | 144-154 | 241-256 | — |
5/5 | 12 | convex learning | 156-169 | — | HW4 |
5/10 | 13 | (catch up) | — | — | — |
5/12 | 14 | regularization & stability | 171-181 | 61-72 | MP2 |
5/17 | 15 | support vector machines | 202-213 | 417-438 | — |
5/19 | 16 | (catch up) | — | — | HW5 |
5/24 | 17 | kernels | 215-224 | 417-438 | — |
5/26 | 18 | NO CLASS | — | — | HW6 |
5/31 | 19 | kernels | 342-355 | 261-270 | — |
6/2 | 20 | clustering and generative models | 307-320 | 501-527 | MP3 |
Assignments
All assignments must be submitted via gradescope to obtain credit. See Homework 0 below for information on how to set up an account.
I provide the \(\LaTeX\) file used to generate each homework below. You must use this as a template to receive extra credit.
- Homework 0, Due: April 1, 2022 (pdf) (tex)
- Homework 1, Due: April 8, 2022 (pdf) (tex) (files)
- Homework 2, Due: April 15, 2022 (pdf) (tex) (files)
- Mini Project 1, Due: April 22, 2022 (description) (tex)
- Homework 3, Due: April 29, 2022 (pdf) (tex) (files)
- Homework 4, Due: May 6, 2022 (pdf) (tex) (files)
- Mini Project 2, Due: May 13, 2022 (description) (tex)
- Homework 5, Due: May 20, 2022 (pdf) (tex)
- Homework 6, Due: May 27, 2022 (pdf) (tex) (files)
- Mini Project 3, Due: June 10, 2022 (description) (tex)
Resources
- \(\LaTeX\): The best way to learn is to hack examples, like those I provide for the homework assignments above. A few other good resources are below.
- tutorial
- Learn LaTeX in 30 minutes
- wikibook
- LaTeX math symbols
- Overleaf: An online LaTeX editor with a Google Docs flavor
- An easy way to include code with your \(\LaTeX\) file is via the pdfpages package.
- Technical Resources: The below may be helpful resources.
- Python Resources: If you choose to code in Python, you should install Python 3 (latest stable version) using the Anaconda distribution. I also recommend making use of Jupyter notebooks, but this is not required.