Array ( [0] => Array ( [title] => L17A [link] => https://www.youtube.com/embed/RHDBqwlSGVU ) [1] => Array ( [title] => L17B [link] => https://www.youtube.com/embed/Pw4Wfc6_I3Y ) [2] => Array ( [title] => L17C [link] => https://www.youtube.com/embed/LDgO53BbujI ) [3] => Array ( [title] => L17D [link] => https://www.youtube.com/embed/_kBJfCzDYT0 ) [4] => Array ( [title] => L17E [link] => https://www.youtube.com/embed/VgciEq2x4-Y ) [5] => Array ( [title] => L17F [link] => https://www.youtube.com/embed/F7b8Qa7b7TI ) [6] => Array ( [title] => L17G [link] => https://www.youtube.com/embed/0xaRGpsLmLs ) [7] => Array ( [title] => L17H [link] => https://www.youtube.com/embed/iqspZa_S8Fs ) [8] => Array ( [title] => L17I [link] => https://www.youtube.com/embed/_bS16s6iY3I ) [9] => Array ( [title] => L17J [link] => https://www.youtube.com/embed/3fVzlwqeQrY ) ) 國立清華大學開放式課程OpenCourseWare(NTHU, OCW) - 第17講 Deep Reinforcement Learning/ DQN & Policy Network

Title

第17講 Deep Reinforcement Learning/ DQN & Policy Network

第1節

L17A

第2節

L17B

第3節

L17C

第4節

L17D

第5節

L17E

第6節

L17F

第7節

L17G

第8節

L17H

第9節

L17I

第10節

L17J

Syllabus

章節大綱

L17A
        Introduction
 
L17B
        Deep Q-Network (DQN)
 
L17C
        Double DQN
 
L17D
       Deep Reinforcement Learning/ DQN & Policy Network 
L17E
        Dueling Network
 
L17F
        NoisyNet and Scalable Implementations (e.g.
        Google Gorila)
 
L17G
        Policy Gradient Methods & DDPG
 
L17H
        Episodic Policy Gradient & REINFORCE
 
L17I
        Reducing Variance
 
L17J
        Baseline Subtraction