11 lines
269 B
Markdown
11 lines
269 B
Markdown
# *On-Policy First-Visit MC Control*
|
|
|
|
## 环境说明
|
|
|
|
见[环境说明](https://github.com/JohnJim0816/reinforcement-learning-tutorials/blob/master/env_info.md)中的The Racetrack
|
|
|
|
## First-Visit MC 介绍
|
|
|
|
### 伪代码
|
|
|
|
 |