qiwang
|
54ab1e8bfa
|
update ch5
|
2025-02-09 12:54:49 +08:00 |
|
qiwang
|
c458b7fd3e
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2024-11-04 23:06:08 +08:00 |
|
qiwang
|
e40d62e346
|
update ch7
|
2024-11-04 23:05:50 +08:00 |
|
Qi Wang
|
7ab4c1ef42
|
Update chapter1.md
|
2024-10-22 14:06:11 +08:00 |
|
Qi Wang
|
10302d870e
|
Update README.md
|
2024-09-09 15:33:10 +08:00 |
|
Qi Wang
|
fceb23c1de
|
Merge pull request #162 from goodmorning-hwt/a-small-typo-in-chap4
udpate chap4
|
2024-09-08 22:52:56 +08:00 |
|
qiwang
|
6d3975008f
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2024-09-08 13:45:05 +08:00 |
|
qiwang
|
55fd6efe48
|
update
|
2024-09-08 13:44:50 +08:00 |
|
He Wentao
|
b8251ada61
|
udpate chap4
fix a small typo in chap4.
4.2.1中"但是实际上我们是在做采样本来这边应该是一个期望...."
我想应该是缺少了一个句号。“但是实际上我们是在做采样。本来...”
(刚好看到就随手提交了)
|
2024-07-25 20:21:13 +08:00 |
|
Qi Wang
|
fcd839a3bb
|
Update README.md
|
2024-07-22 18:55:16 +08:00 |
|
qiwang
|
f4bf1430d9
|
udpate img
|
2024-06-24 13:13:49 +08:00 |
|
qiwang
|
7a0811b55c
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2024-06-24 13:12:50 +08:00 |
|
qiwang
|
32516ee106
|
udpate ch2
|
2024-06-24 13:12:34 +08:00 |
|
Yiyuan Yang
|
479fb6dc6b
|
Update chapter1_questions&keywords.md
|
2024-06-20 11:33:06 +01:00 |
|
Yiyuan Yang
|
1c154588d7
|
Update chapter1_questions&keywords.md
|
2024-06-20 11:30:54 +01:00 |
|
qiwang
|
262664c1fe
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2024-06-18 19:51:28 +08:00 |
|
qiwang
|
b44a51aa36
|
update errata
|
2024-06-18 19:51:03 +08:00 |
|
Qi Wang
|
815bbd81d4
|
Merge pull request #159 from GorgeousWang/patch-2
Update chapter1.md
|
2024-06-18 19:41:58 +08:00 |
|
Yiyuan Yang
|
c623f58f73
|
Update README.md
|
2024-06-18 08:09:10 +08:00 |
|
fuyuwang
|
929f0cafad
|
Update chapter1.md
步数>200并不代表游戏的输赢,容易产生误解。>200属于游戏截断操作。参考:https://www.gymlibrary.dev/environments/classic_control/cart_pole/
|
2024-06-17 17:21:28 +08:00 |
|
Qi Wang
|
5e07eb89f8
|
delete spaces
|
2024-06-17 13:46:42 +08:00 |
|
qiwang067
|
b6f7133169
|
update ch1.md
|
2024-06-16 20:06:51 +08:00 |
|
qiwang067
|
75ffda1954
|
update errata
|
2024-06-14 01:35:51 +08:00 |
|
qiwang067
|
00e91b53a2
|
update ch14
|
2024-06-09 21:38:38 +08:00 |
|
qiwang067
|
a7cdf0e8d2
|
update paper list
|
2024-06-09 21:36:34 +08:00 |
|
Qi Wang
|
7168b2021a
|
Update README.md
|
2024-06-09 21:21:46 +08:00 |
|
Qi Wang
|
293213eb49
|
Update README.md
|
2024-06-02 10:31:02 +08:00 |
|
Qi Wang
|
fd844fb786
|
Update README.md
|
2024-06-02 10:22:29 +08:00 |
|
Qi Wang
|
7283499d00
|
Add files via upload
|
2024-06-01 22:09:30 +08:00 |
|
Qi Wang
|
a5e2cca150
|
update Q-learning
|
2024-06-01 22:08:31 +08:00 |
|
Qi Wang
|
bd25553a80
|
Merge pull request #145 from ssccinng/patch-1
[fix] 括号不匹配
|
2024-03-19 11:05:28 +08:00 |
|
Qi Wang
|
62152c6dd2
|
Update README.md
|
2024-03-10 01:35:23 +08:00 |
|
qiwang067
|
6aad94dc83
|
update readme
|
2024-03-07 20:32:38 +08:00 |
|
qiwang067
|
5c9b58880d
|
update typos
|
2024-02-23 16:58:16 +08:00 |
|
qiwang067
|
a877519952
|
update readme
|
2024-02-05 10:08:54 +08:00 |
|
qiwang067
|
c7a7577766
|
update errata
|
2024-02-04 22:10:55 +08:00 |
|
qiwang067
|
7a49e69d00
|
update readme
|
2024-02-04 16:52:48 +08:00 |
|
qiwang067
|
74ef34158d
|
update readme
|
2024-02-01 21:53:04 +08:00 |
|
qiwang067
|
41693f51cd
|
update errata
|
2024-01-30 11:35:33 +08:00 |
|
qiwang067
|
ebcb4adb6f
|
update readme
|
2024-01-27 23:22:37 +08:00 |
|
qiwang067
|
9a578d4221
|
update ch1.md
|
2024-01-16 21:40:41 +08:00 |
|
qiwang067
|
d48754c21d
|
update errata
|
2023-12-13 02:57:31 +08:00 |
|
qiwang067
|
82fde031fb
|
Merge branch 'master' of github.com:datawhalechina/easy-rl
|
2023-11-24 16:10:31 +08:00 |
|
qiwang067
|
c325345f81
|
update errata
|
2023-11-24 16:00:46 +08:00 |
|
Qi Wang
|
5682246cb6
|
Merge pull request #148 from taojunhui/patch-1
Update chapter10.md 修改黄金状态(gold state)为目标状态(goal state)
|
2023-11-19 16:48:43 +08:00 |
|
Junhui Tao
|
4d3799efdf
|
Update chapter10.md 修改黄金状态(gold state)为目标状态(goal state)
在李宏毅稀疏奖励那一节课中,逆强化学习的那个最终目标,称为目标状态而不是黄金状态
|
2023-11-19 16:37:27 +08:00 |
|
ssccinng
|
5577979540
|
[fix] 括号不匹配
|
2023-10-15 14:54:52 +08:00 |
|
qiwang067
|
d86741fbbd
|
update errata
|
2023-10-14 21:14:24 +08:00 |
|
qiwang067
|
19fbe26846
|
update errata
|
2023-10-14 21:06:47 +08:00 |
|
qiwang067
|
13cd10e211
|
update 6-7
|
2023-10-14 20:58:43 +08:00 |
|