Commit Graph

915 Commits

Author SHA1 Message Date
Qi Wang
fceb23c1de Merge pull request #162 from goodmorning-hwt/a-small-typo-in-chap4
udpate chap4
2024-09-08 22:52:56 +08:00
qiwang
6d3975008f Merge branch 'master' of github.com:datawhalechina/easy-rl 2024-09-08 13:45:05 +08:00
qiwang
55fd6efe48 update 2024-09-08 13:44:50 +08:00
He Wentao
b8251ada61 udpate chap4
fix a small typo in chap4.
4.2.1中"但是实际上我们是在做采样本来这边应该是一个期望...."
我想应该是缺少了一个句号。“但是实际上我们是在做采样。本来...”
(刚好看到就随手提交了)
2024-07-25 20:21:13 +08:00
Qi Wang
fcd839a3bb Update README.md 2024-07-22 18:55:16 +08:00
qiwang
f4bf1430d9 udpate img 2024-06-24 13:13:49 +08:00
qiwang
7a0811b55c Merge branch 'master' of github.com:datawhalechina/easy-rl 2024-06-24 13:12:50 +08:00
qiwang
32516ee106 udpate ch2 2024-06-24 13:12:34 +08:00
Yiyuan Yang
479fb6dc6b Update chapter1_questions&keywords.md 2024-06-20 11:33:06 +01:00
Yiyuan Yang
1c154588d7 Update chapter1_questions&keywords.md 2024-06-20 11:30:54 +01:00
qiwang
262664c1fe Merge branch 'master' of github.com:datawhalechina/easy-rl 2024-06-18 19:51:28 +08:00
qiwang
b44a51aa36 update errata 2024-06-18 19:51:03 +08:00
Qi Wang
815bbd81d4 Merge pull request #159 from GorgeousWang/patch-2
Update chapter1.md
2024-06-18 19:41:58 +08:00
Yiyuan Yang
c623f58f73 Update README.md 2024-06-18 08:09:10 +08:00
fuyuwang
929f0cafad Update chapter1.md
步数>200并不代表游戏的输赢,容易产生误解。>200属于游戏截断操作。参考:https://www.gymlibrary.dev/environments/classic_control/cart_pole/
2024-06-17 17:21:28 +08:00
Qi Wang
5e07eb89f8 delete spaces 2024-06-17 13:46:42 +08:00
qiwang067
b6f7133169 update ch1.md 2024-06-16 20:06:51 +08:00
qiwang067
75ffda1954 update errata 2024-06-14 01:35:51 +08:00
qiwang067
00e91b53a2 update ch14 2024-06-09 21:38:38 +08:00
qiwang067
a7cdf0e8d2 update paper list 2024-06-09 21:36:34 +08:00
Qi Wang
7168b2021a Update README.md 2024-06-09 21:21:46 +08:00
Qi Wang
293213eb49 Update README.md 2024-06-02 10:31:02 +08:00
Qi Wang
fd844fb786 Update README.md 2024-06-02 10:22:29 +08:00
Qi Wang
bd25553a80 Merge pull request #145 from ssccinng/patch-1
[fix] 括号不匹配
2024-03-19 11:05:28 +08:00
Qi Wang
62152c6dd2 Update README.md 2024-03-10 01:35:23 +08:00
qiwang067
6aad94dc83 update readme 2024-03-07 20:32:38 +08:00
qiwang067
5c9b58880d update typos 2024-02-23 16:58:16 +08:00
qiwang067
a877519952 update readme 2024-02-05 10:08:54 +08:00
qiwang067
c7a7577766 update errata 2024-02-04 22:10:55 +08:00
qiwang067
7a49e69d00 update readme 2024-02-04 16:52:48 +08:00
qiwang067
74ef34158d update readme 2024-02-01 21:53:04 +08:00
qiwang067
41693f51cd update errata 2024-01-30 11:35:33 +08:00
qiwang067
ebcb4adb6f update readme 2024-01-27 23:22:37 +08:00
qiwang067
9a578d4221 update ch1.md 2024-01-16 21:40:41 +08:00
qiwang067
d48754c21d update errata 2023-12-13 02:57:31 +08:00
qiwang067
82fde031fb Merge branch 'master' of github.com:datawhalechina/easy-rl 2023-11-24 16:10:31 +08:00
qiwang067
c325345f81 update errata 2023-11-24 16:00:46 +08:00
Junhui Tao
4d3799efdf Update chapter10.md 修改黄金状态(gold state)为目标状态(goal state)
在李宏毅稀疏奖励那一节课中,逆强化学习的那个最终目标,称为目标状态而不是黄金状态
2023-11-19 16:37:27 +08:00
ssccinng
5577979540 [fix] 括号不匹配 2023-10-15 14:54:52 +08:00
qiwang067
d86741fbbd update errata 2023-10-14 21:14:24 +08:00
qiwang067
19fbe26846 update errata 2023-10-14 21:06:47 +08:00
qiwang067
13cd10e211 update 6-7 2023-10-14 20:58:43 +08:00
qiwang067
b7e9d7d880 update errata 2023-10-14 20:56:43 +08:00
qiwang067
293526d5b1 update ch1.md 2023-10-08 12:50:28 +08:00
qiwang067
8a67022041 update ch1.md 2023-10-08 12:38:12 +08:00
qiwang067
90f8b0ce71 update README.md 2023-07-25 18:24:15 +08:00
qiwang067
cc61269177 update errata.md 2023-07-25 16:59:16 +08:00
qiwang067
e164cf27a6 update ch1.md 2023-07-25 16:50:47 +08:00
qiwang067
5d248d68b7 update ch1.md 2023-07-25 16:39:47 +08:00
qiwang067
297153d376 update errata 2023-07-22 00:57:05 +08:00