update projects
This commit is contained in:
19
projects/codes/Sarsa/README.md
Normal file
19
projects/codes/Sarsa/README.md
Normal file
@@ -0,0 +1,19 @@
|
||||
# Sarsa
|
||||
|
||||
## 使用说明
|
||||
|
||||
运行```main.py```即可
|
||||
|
||||
## 环境说明
|
||||
|
||||
见[环境说明](https://github.com/JohnJim0816/reinforcement-learning-tutorials/blob/master/env_info.md)中的The Racetrack
|
||||
|
||||
## 算法伪代码
|
||||
|
||||

|
||||
|
||||
## 其他说明
|
||||
|
||||
### 与Q-learning区别
|
||||
|
||||
算法上区别很小,只在更新公式上,但Q-learning是Off-policy,而Sarsa是On-policy,可参考[知乎:强化学习中sarsa算法是不是比q-learning算法收敛速度更慢?](https://www.zhihu.com/question/268461866)
|
||||
Reference in New Issue
Block a user