From ea65168834f8e9b97fe7d9e32f92fc2ab2727e28 Mon Sep 17 00:00:00 2001 From: qiwang067 Date: Fri, 8 Nov 2024 15:43:20 +0800 Subject: [PATCH 1/3] update errata --- docs/errata.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/docs/errata.md b/docs/errata.md index 6fe2d01..767d75e 100644 --- a/docs/errata.md +++ b/docs/errata.md @@ -23,6 +23,10 @@ $$ * 140页,第一段最前面加上:本章介绍基于价值的典型强化学习算法——**深度Q网络(deep Q-network,DQN)**。 * 140页,第三段第一行:深度 Q 网络(deep Q-network,DQN)→ 深度 Q 网络。 +* 165页,第一段第2行:归一化(normalization)。归一化的过程 → 零均值化。零均值化的过程 +* 165页,第一段第4行:归一化 → 零均值化 +* 165页,第二段第2行:归一化 → 零均值化 +* 165页,第二段第3行:归一化 → 零均值化 ## 第1版第8次印刷(2023.11) From 2243904083999f2b4844a32a0cb0a642f4b2b87a Mon Sep 17 00:00:00 2001 From: Qi Wang Date: Sun, 19 Jan 2025 16:15:23 +0800 Subject: [PATCH 2/3] Update README.md --- docs/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/README.md b/docs/README.md index a3f5eb5..8e0bd3c 100644 --- a/docs/README.md +++ b/docs/README.md @@ -18,7 +18,7 @@ -推荐购买链接:[京东](https://u.jd.com/tG2sxLb) | [当当](http://product.dangdang.com/29374163.html) +推荐购买链接:[京东](https://item.jd.com/13075567.html) | [当当](http://product.dangdang.com/29374163.html) From f23cda6502f920c7fb75f1f8c003752a544c5c70 Mon Sep 17 00:00:00 2001 From: Qi Wang Date: Sat, 1 Feb 2025 09:47:53 +0800 Subject: [PATCH 3/3] Update README.md --- docs/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/docs/README.md b/docs/README.md index 8e0bd3c..075a912 100644 --- a/docs/README.md +++ b/docs/README.md @@ -90,6 +90,7 @@ PDF版本是全书初稿,人民邮电出版社的编辑老师们对初稿进 [点击](https://github.com/datawhalechina/easy-rl/tree/master/papers)或者网页点击```papers```文件夹进入经典强化学习论文解读 ## 扩展资源 +- 对**强化学习玩我的世界(Minecraft)游戏**感兴趣的读者,可阅读 [LS-Imagine](https://github.com/qiwang067/LS-Imagine) - 对**视觉强化学习**感兴趣的读者,可阅读[Awesome Visual RL](https://github.com/qiwang067/awesome-visual-rl) - 对**深度学习**感兴趣的读者,可阅读[李宏毅深度学习教程LeeDL-Tutorial](https://github.com/datawhalechina/leedl-tutorial)