Update chapter8_questions&keywords.md

2021-02-07 22:35:37 +08:00
parent 517fbfb4d5
commit 2d116a5019
1 changed files with 2 additions and 2 deletions
@@ -1,6 +1,6 @@
 # Chapter8 Q-learning for Continuous Actions

-## 思考题
+## Questions

 - Q-learning相比于policy gradient based方法为什么训练起来效果更好，更平稳？