Reinforcement Learning for Intelligent Decision Making

Instructor: Jiadong Yu

Term: Fall

Location: HKUST(GZ)

Time: 2024/2025 Fall

Course Overview

This MPhil/PhD course systematically covers reinforcement learning from fundamental concepts to advanced methods, with an emphasis on practical implementation and research-oriented understanding.

Multi-armed bandits and decision-making foundations
Markov decision processes and value functions
Model-free and model-based reinforcement learning
Deep reinforcement learning
Advanced RL and multi-agent reinforcement learning

Offering

IOTA 5201
Semester: 2024/2025 Fall
Level: MPhil/PhD

Topics

Multi-armed bandits and sequential decision-making
Monte Carlo, temporal difference learning, Q-learning, and SARSA
Policy gradient and actor-critic methods
Deep Q-learning, PPO, TRPO, and DDPG
RLHF and multi-agent reinforcement learning
Practical coding and application-driven RL study

Teaching Focus

The course moves from RL foundations to advanced RL and MARL, helping postgraduate students connect algorithmic understanding with implementation and research applications.