fuyuwang
|
929f0cafad
|
Update chapter1.md
步数>200并不代表游戏的输赢,容易产生误解。>200属于游戏截断操作。参考:https://www.gymlibrary.dev/environments/classic_control/cart_pole/
|
2024-06-17 17:21:28 +08:00 |
|
Qi Wang
|
5e07eb89f8
|
delete spaces
|
2024-06-17 13:46:42 +08:00 |
|
qiwang067
|
b6f7133169
|
update ch1.md
|
2024-06-16 20:06:51 +08:00 |
|
qiwang067
|
75ffda1954
|
update errata
|
2024-06-14 01:35:51 +08:00 |
|
qiwang067
|
00e91b53a2
|
update ch14
|
2024-06-09 21:38:38 +08:00 |
|
qiwang067
|
a7cdf0e8d2
|
update paper list
|
2024-06-09 21:36:34 +08:00 |
|
Qi Wang
|
7168b2021a
|
Update README.md
|
2024-06-09 21:21:46 +08:00 |
|
Qi Wang
|
293213eb49
|
Update README.md
|
2024-06-02 10:31:02 +08:00 |
|
Qi Wang
|
fd844fb786
|
Update README.md
|
2024-06-02 10:22:29 +08:00 |
|
Qi Wang
|
7283499d00
|
Add files via upload
|
2024-06-01 22:09:30 +08:00 |
|
Qi Wang
|
a5e2cca150
|
update Q-learning
|
2024-06-01 22:08:31 +08:00 |
|
Qi Wang
|
bd25553a80
|
Merge pull request #145 from ssccinng/patch-1
[fix] 括号不匹配
|
2024-03-19 11:05:28 +08:00 |
|
Qi Wang
|
62152c6dd2
|
Update README.md
|
2024-03-10 01:35:23 +08:00 |
|
qiwang067
|
6aad94dc83
|
update readme
|
2024-03-07 20:32:38 +08:00 |
|
qiwang067
|
5c9b58880d
|
update typos
|
2024-02-23 16:58:16 +08:00 |
|
qiwang067
|
a877519952
|
update readme
|
2024-02-05 10:08:54 +08:00 |
|
qiwang067
|
c7a7577766
|
update errata
|
2024-02-04 22:10:55 +08:00 |
|
qiwang067
|
7a49e69d00
|
update readme
|
2024-02-04 16:52:48 +08:00 |
|
qiwang067
|
74ef34158d
|
update readme
|
2024-02-01 21:53:04 +08:00 |
|
qiwang067
|
41693f51cd
|
update errata
|
2024-01-30 11:35:33 +08:00 |
|
qiwang067
|
ebcb4adb6f
|
update readme
|
2024-01-27 23:22:37 +08:00 |
|
qiwang067
|
9a578d4221
|
update ch1.md
|
2024-01-16 21:40:41 +08:00 |
|
qiwang067
|
d48754c21d
|
update errata
|
2023-12-13 02:57:31 +08:00 |
|
qiwang067
|
82fde031fb
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2023-11-24 16:10:31 +08:00 |
|
qiwang067
|
c325345f81
|
update errata
|
2023-11-24 16:00:46 +08:00 |
|
Qi Wang
|
5682246cb6
|
Merge pull request #148 from taojunhui/patch-1
Update chapter10.md 修改黄金状态(gold state)为目标状态(goal state)
|
2023-11-19 16:48:43 +08:00 |
|
Junhui Tao
|
4d3799efdf
|
Update chapter10.md 修改黄金状态(gold state)为目标状态(goal state)
在李宏毅稀疏奖励那一节课中,逆强化学习的那个最终目标,称为目标状态而不是黄金状态
|
2023-11-19 16:37:27 +08:00 |
|
ssccinng
|
5577979540
|
[fix] 括号不匹配
|
2023-10-15 14:54:52 +08:00 |
|
qiwang067
|
d86741fbbd
|
update errata
|
2023-10-14 21:14:24 +08:00 |
|
qiwang067
|
19fbe26846
|
update errata
|
2023-10-14 21:06:47 +08:00 |
|
qiwang067
|
13cd10e211
|
update 6-7
|
2023-10-14 20:58:43 +08:00 |
|
qiwang067
|
b7e9d7d880
|
update errata
|
2023-10-14 20:56:43 +08:00 |
|
qiwang067
|
293526d5b1
|
update ch1.md
|
2023-10-08 12:50:28 +08:00 |
|
qiwang067
|
8a67022041
|
update ch1.md
|
2023-10-08 12:38:12 +08:00 |
|
qiwang067
|
90f8b0ce71
|
update README.md
|
2023-07-25 18:24:15 +08:00 |
|
qiwang067
|
cc61269177
|
update errata.md
|
2023-07-25 16:59:16 +08:00 |
|
qiwang067
|
e164cf27a6
|
update ch1.md
|
2023-07-25 16:50:47 +08:00 |
|
qiwang067
|
5d248d68b7
|
update ch1.md
|
2023-07-25 16:39:47 +08:00 |
|
qiwang067
|
297153d376
|
update errata
|
2023-07-22 00:57:05 +08:00 |
|
qiwang067
|
f6dacf2f53
|
update ch1.md
|
2023-07-22 00:51:54 +08:00 |
|
qiwang067
|
60e7bf694c
|
update errata
|
2023-07-21 23:51:46 +08:00 |
|
qiwang067
|
df0673e12c
|
update ch1
|
2023-07-21 23:40:38 +08:00 |
|
qiwang067
|
e1462e7b2b
|
update errata
|
2023-07-21 23:36:50 +08:00 |
|
qiwang067
|
385d504eb2
|
update chapter1.md
|
2023-07-21 23:35:50 +08:00 |
|
qiwang067
|
e009758c36
|
update chapter1.md
|
2023-07-21 22:52:26 +08:00 |
|
qiwang067
|
385469de70
|
update RL_example.py
|
2023-07-21 22:50:02 +08:00 |
|
qiwang067
|
e4fb1ba4fb
|
update chapter1.md
|
2023-07-21 22:38:26 +08:00 |
|
qiwang067
|
64e99353a0
|
update RL_example.py
|
2023-07-21 22:36:46 +08:00 |
|
qiwang067
|
270e89d5b9
|
update errata
|
2023-07-21 22:09:18 +08:00 |
|
qiwang067
|
abb87a51ea
|
update RL_example.py
|
2023-07-21 21:54:22 +08:00 |
|