Files
easy-rl/projects/codes/Sarsa/README.md
2022-07-31 23:42:12 +08:00

19 lines
534 B
Markdown
Raw Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Sarsa
## 使用说明
运行```main.py```即可
## 环境说明
见[环境说明](https://github.com/JohnJim0816/reinforcement-learning-tutorials/blob/master/env_info.md)中的The Racetrack
## 算法伪代码
![sarsa_algo](assets/sarsa_algo.png)
## 其他说明
### 与Q-learning区别
算法上区别很小只在更新公式上但Q-learning是Off-policy而Sarsa是On-policy可参考[知乎强化学习中sarsa算法是不是比q-learning算法收敛速度更慢](https://www.zhihu.com/question/268461866)