Math 535 Advanced Statistical Learning. Spring 2025.
This course is a 4-credit course, which means that in addition to the scheduled lectures/discussions, students are expected to do at least 9.5 hours of course-related work each week during the semester. This includes things like: completing assigned readings, participating in lab sessions, studying for tests and examinations, preparing written assignments, completing internship or clinical placement requirements, and other tasks that must be completed to earn credit in the course. Prerequisite
DescriptionThis course is a survey of statistical learning and data mining methods. It will cover major statistical learning methods and concepts for both supervised and unsupervised learning. Topics covered include regression methods with sparsity or other regularizations, model selection, graphical models, statistical learning pipeline and best practice, introduction to classification, including discriminant analysis, logistic regression, support vector machines, and kernel methods, nonlinear methods, dimension reduction, including matrix factorization-based approaches - principal component analysis and non-negative matrix factorization-, multidimensional scaling, and independent component analysis, clustering, decision trees, random forest, boosting and ensemble learning. List of Topics
Learning OutcomesStudents will learn how and when to apply statistical learning techniques, their comparative strengths and weaknesses, and how to critically evaluate the performance of learning algorithms. Students completing this course should be able to
Recommended Texts
PiazzaPlease use Piazza (www.piazza.com) for all communications with me rather than email. Piazza is a question-and-answer platform. It supports LaTeX, code formatting, embedding of images, and attaching of files. You are encouraged to ask questions when you have difficulty understanding a concept or working around a piece of code – you can even ask questions anonymously. Moreover, you can also answer questions from your classmates. I constantly monitor the answers and endorse those good answers. Announcement will be sent to the class using Piazza. GradescopeWe will use Gradescope to submit and grade homework. This will allow the instructor to efficient grade all the work and give feedback in a timely manner. BrightspaceBrightspace will only be used for recording grades on assignments and exams and for distributing solutions. The code and lecture notes can also be found on blackboard. Grading
Homework PolicyThere will be a deduction of 15% of the grade for each day homeworks are late (the final grade for a late homework that is N days late will be 0.85^N times the real grade). Homeworks may be discussed with classmates but must be written and submitted individually. Final ProjectStudents will compete against each other in a the Final Project. It can be completed in teams of 2 – 4 members. Grades will be based upon a progress report and a final report (one per team) as well as the contest results. Further details about the Final Project along with specific grading criteria will be given in a separate document and discussed in class. SoftwareRStudio and Google Colab will be used for completion of the homework assignments. |