Lecture 6: Q-Network by Sung Kim

Notice

Recent Posts

Tags more

Archives

관리 메뉴

JMANI

AI/Reinforcement Learning

jmani 2022. 5. 23. 17:50

Q-Network training: 상태를 입력으로 받아 가능한 모든 q-value 를 얻음

theta는 network의 weight

theta를 최소화 시키기 위해 학습

Markov Decision Process (0)	2022.06.27
Q-learning (0)	2022.06.24
Lecture 5: Q-learning on Nondeterministic Worlds! by Sung Kim (0)	2022.05.20
Lecture 4: Q-learning (table) exploit&exploration and discounted reward by Sung Kim (0)	2022.05.20
Lecture 3: Dummy Q-learning (table) by Sung Kim (0)	2022.05.19

'AI/Reinforcement Learning' Related Articles

Comments