This website requires JavaScript.
Explore
Help
Register
Sign In
bacow
/
easy-rl
Watch
1
Star
0
Fork
0
You've already forked easy-rl
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
03dedd6cc11b5d93f623aeb1523d54e2f7bb253b
easy-rl
/
docs
/
_sidebar.md
qiwang067
03dedd6cc1
change contents
2020-07-04 15:52:20 +08:00
445 B
Executable File
Raw
Blame
History
目录
P1 策略梯度
P2 近端策略优化 (PPO) 算法
P3 Q 学习 (基本概念)
P4 Q 学习 (进阶技巧)
P5 Q 学习 (连续动作)
P6 演员-评论员算法
P7 稀疏奖励
P8 模仿学习
Reference in New Issue
View Git Blame
Copy Permalink