CSE 599 D1 Theoretical Deep LearningSummaryInstructor: Simon S. Du This is a graduate course focused on research in theoretical aspects of deep learning. Deep learning has become the central paradigm of machine learning. However, a mathematical understanding is still lacking. How many samples are needed? How fast does training succeed? Why are convolutional neural networks suitable for image data? This course will cover recent advances in answering these questions. Announcements
PrerequisiteThis is an advanced and theory-heavy course: there is no programming assignment and students are required to work on a theory-focused course project. Students entering the class should have a working knowledge of machine learning, probability and statistics, optimization and linear algebra. Backgrounds on deep learning is a bonus but not necessary. Grading PoliciesAssignments 50% (25% each) and Project 50% (10% proposal, 15% presentation, 25% final report). HomeworkThere are two homework, which will be released on Canvas.
Timeline:
Course ProjectProjects should be done in groups of 2-3, with the intention of exploring one direction in greater detail. If you cannot find team members for a group, send an email to the instructor. The projects can be either a literature review or original research. We list ideas for a few project topics on Canvas. It is okay to do projects on topic not listed. Timeline:
Format: You must use the NeurIPS Latex format. Reference ReadingsRelated CoursesSanjeev Aora: Theoretical Deep Learning |