更新算法模版
This commit is contained in:
7
projects/codes/A2C/README.md
Normal file
7
projects/codes/A2C/README.md
Normal file
@@ -0,0 +1,7 @@
|
||||
## 脚本描述
|
||||
|
||||
* `task0.py`:离散动作任务
|
||||
|
||||
* `task1.py`:离散动作任务,与`task0.py`唯一的区别就是Actor的激活函数是tanh而不是relu,在`CartPole-v1`上效果更好
|
||||
|
||||
* `task2.py`:连续动作任务,#TODO待调试
|
||||
Reference in New Issue
Block a user