Summary of CSA 815: Reinforcement Learning

CSA 815: Reinforcement Learning

Course Purpose

This course introduces reinforcement learning, covering fundamental concepts, key algorithms such as Q-learning, SARSA, and DQN and practical solutions for complex decision-making tasks using appropriate tools and techniques.

Course Learning Outcomes

Explain the basic concepts and principles of reinforcement learning, including agents, environments, rewards, and policies.
Apply key reinforcement learning algorithms such as Q-learning, SARSA, and deep Q-networks (DQN) to solve practical problems.
Evaluate and compare reinforcement learning algorithms in terms of efficiency, convergence, and performance in real-world tasks.
Design and implement reinforcement learning solutions for complex decision-making problems using suitable tools and techniques.

Course Content

Introduction to Reinforcement Learning: key concepts of agents, environments, actions, rewards, policies, states, and value functions; followed by Foundations of Reinforcement Learning: Markov Decision Processes (MDPs), Bellman equations, reward signals, policy evaluation, policy improvement, and optimality; next, Core Reinforcement Learning Algorithms: value iteration, policy iteration, Q-learning, SARSA, Monte Carlo methods; Deep Reinforcement Learning: deep Q-networks (DQN), policy gradient methods, actor-critic models, and applications of deep learning to RL; the course will also cover Exploration vs. Exploitation: epsilon-greedy strategy, Upper Confidence Bound (UCB), Thompson sampling, and exploration strategies; Advanced Topics in Reinforcement Learning: multi-agent systems, inverse reinforcement learning, continuous action spaces, and applications in robotics, game playing, and autonomous vehicles.

Accessibility

Reset All

Background Colour

Font Face

Font Kerning

Font Size

Image Visibility

Letter Spacing

Line Height

1.2

Link Highlight

Text Colour