國立清華大學開放式課程

工程學群
深度學習：吳尚鴻

授課老師
課程大綱
課程內容

第16講 Reinforcement Learning/Q-learning

L16A
Introduction

L16B
Markov Decision Process (MDP)

L16C
Value Iteration

L16D
Policy Iteration

L16E
Reinforcement Learning

L16F
Model-Free RL based on MC Estimation

L16G
Temporal Difference Learning SARSA

L16H
Exploration Strategies

L16I
Q-Learning

L16J
SARSA vs. Q-Learning

L16A L16B L16C L16D L16E L16F L16G L16H L16I L16J

深度學習：吳尚鴻

第16講 Reinforcement Learning/Q-learning

資料下載

相關連結

第14講 Unsupervised Learning/Autoencoders/GANs

第15講 Semisupervised/Transfer Learning and the Future

第17講 Deep Reinforcement Learning/ DQN & Policy Network