SessionTopicsMaterials
Week 1
1
Monday
Mar 09
  • Course Overview
  • Introduction to Reinforcement Learning
  • Syllabus

    2
    Tuesday
    Mar 10
  • Lab: Cross-Entropy Method

  • Linux Installation DUE Wednesday 03/11/26 11:55 pm Submit Here
    3
    Thursday
    Mar 12
  • Lab: Cross-Entropy Method (cnt.)
  • Instruction
    Lab Submission Guideline
    4
    Friday
    Mar 13
  • Markov Reward Process
  • Bellman Equation

  • MRP Coding Activity
    Lab CEM DUE Friday 03/13/26 11:55 pm Submit Here
    Week 2
    5
    Monday
    Mar 16
  • Markov Decision Process

  • MDP Coding Activity
    6
    Tuesday
    Mar 17
  • Dynamic Programming
  • Value Iteration
  • Policy Iteration

  • 7
    Thursday
    Mar 19
  • Demo: Policy Iteration

  • 8
    Friday
    Mar 20
  • Lab: Value Iteration
  • Instruction
    Lab Value Iteration DUE Friday 03/20/26 11:55 pm Submit Here
    Week 3
    9
    Monday
    Mar 23
  • Model-free Prediction and Control
  • N-step
  • MC and TD

  • 10
    Tuesday
    Mar 24
  • Lab/Demo: SARSA
  • Instruction
    11
    Thursday
    Mar 26
  • Lab/Demo: SARSA (continue)
  • Lab Sarsa DUE Thursday 03/26/26 11:55 pm Submit Here
    12
    Friday
    Mar 27
  • Deep Learning in One Day

  • Week 4
    13
    Monday
    Mar 30
  • BackProp Algorithms

  • A Great Article on BP
    14
    Tuesday
    Mar 31
  • PyTorch Tutorial
  • NN Regression Code
    15
    Thursday
    Apr 02
  • Lab: PyTorch Lab
  • Instruction
    Regression
    16
    Friday
    Apr 03
  • Lab/Demo: PyTorch Lab
  • Train a Classifier
    Lab Pytorch DUE Friday 04/03/26 11:55 pm Submit Here
    Week 5
    17
    Monday
    Apr 06
  • Q learning
  • On-policy v.s. Off-policy

  • Q learning for FrozenLake
    18
    Tuesday
    Apr 07
  • DQN

  • DQN Code
    19
    Thursday
    Apr 09
  • Lab: DDPG
  • Instruction
    20
    Friday
    Apr 10
  • Lab: DDPG (continue)
  • Lab DDPG DUE Friday 04/10/26 11:55 pm Submit Here
    Week 6
    21
    Monday
    Apr 20
  • Policy Gradient

  • REINFORCE Code
    22
    Tuesday
    Apr 21
  • A2C

  • A2C Code
    23
    Thursday
    Apr 23
  • Lab: Enhanced A2C
  • Instruction
    24
    Friday
    Apr 24
  • Lab: Enhanced A2C (continue)
  • Lab Enhanced A2C DUE Friday 04/24/26 11:55 pm Submit Here
    Week 7
    25
    Monday
    Apr 27
  • PPO I

  • OpenAI tutorial on PPO
    26
    Tuesday
    Apr 28
  • PPO II
  • PPO Demo Code
    27
    Thursday
    Apr 30
  • Lab: SAC
  • Instruction
    You need to start preparing Paper Reading/Presentation
    28
    Friday
    May 01
  • Lab: SAC (continue)
  • Lab SAC DUE Friday 05/01/26 11:55 pm Submit Here
    Week 8
    29
    Monday
    May 04
  • Model-Based RL

  • LQR Demo Code
    30
    Tuesday
    May 05
  • Model-Based RL II
  • PETS Paper
    31
    Thursday
    May 07
  • Project Prep Day

  • 32
    Friday
    May 08
  • Project Proposal (no class)
  • Each group needs to meet with me by the end of today.
    Project Proposal Meeting DUE Friday 05/08/26 11:55 pm
    Week 9
    33
    Monday
    May 11
  • Seminar: AlphaGo

  • 34
    Tuesday
    May 12
  • Project Work Time/Seminar (TBD)

  • 35
    Thursday
    May 14
  • Project Work Time

  • 36
    Friday
    May 15
  • Paper Presentation
  • Week 10
    37
    Monday
    May 18
  • Project Work Time
  • 38
    Tuesday
    May 19
  • Project Work Time
  • 39
    Thursday
    May 21
  • Final Presentation
  • Team 1 and Team 2
    40
    Friday
    May 22
  • No Class