Update chapter8_questions&keywords.md

This commit is contained in:
David Young
2021-02-07 22:35:37 +08:00
committed by GitHub
parent 517fbfb4d5
commit 2d116a5019

View File

@@ -1,6 +1,6 @@
# Chapter8 Q-learning for Continuous Actions
## 思考题
## Questions
- Q-learning相比于policy gradient based方法为什么训练起来效果更好更平稳