update errata

2022-10-23 20:45:03 +08:00
parent 62ba722f3e
commit ccada95ce2
1 changed files with 1 additions and 3 deletions
@@ -34,6 +34,7 @@
 ![](res/4-19.png ':size=550')
 * 127页，5.1节的标题：从同策略到异策略 → 重要性采样
 * 134页，式(5.16)下面一段第2行：最大化式 (5.16) → 最大化式 (5.15)
 * 165页，第一段的第4行到第5行：归一化的向量为 $[3,-1,2]^{\mathrm{T}}$ → 归一化的向量为 $[3,-1,-2]^{\mathrm{T}}$ 
 * 165页，第二段的第1行：向量 $[3,-1,2]^{\mathrm{T}}$ 中的每个元素 → 向量 $[3,-1,-2]^{\mathrm{T}}$ 中的每个元素 
@@ -41,9 +42,6 @@
 ![](res/9-4.png ':size=550')
 ## 第1版第2次印刷（2022.06）
 * 1页，图1.1删除参考文献：SUTTON R S, BARTO A G. Reinforcement learning: An introduction (second edition)[M]. London: The MIT Press, 2018