This course focuses on the core principles of RL. Like statistical learning, a central challenge of RL is to generalize learned capabilities to unseen environments. However, RL faces additional challenges such as exploration-exploitation tradeoff, credit assignment, and distribution mismatch between behavior and target policies. Throughout the course, we will delve into various solutions to these challenges and provide theoretical justifications.