update
BIN
docs/chapter12/assets/image-20201015221602396.png
Normal file
|
After Width: | Height: | Size: 535 KiB |
BIN
docs/chapter12/assets/moving_average_rewards_eval.png
Normal file
|
After Width: | Height: | Size: 50 KiB |
BIN
docs/chapter12/assets/moving_average_rewards_train.png
Normal file
|
After Width: | Height: | Size: 40 KiB |
BIN
docs/chapter12/assets/rewards_eval.png
Normal file
|
After Width: | Height: | Size: 74 KiB |
BIN
docs/chapter12/assets/rewards_train.png
Normal file
|
After Width: | Height: | Size: 56 KiB |
BIN
docs/chapter12/assets/steps_eval.png
Normal file
|
After Width: | Height: | Size: 23 KiB |
BIN
docs/chapter12/assets/steps_train.png
Normal file
|
After Width: | Height: | Size: 23 KiB |
@@ -53,7 +53,23 @@ for i_episode in range(1, cfg.max_episodes+1): # cfg.max_episodes为最大训练
|
||||
|
||||
训练并绘制reward以及滑动平均后的reward随epiosde的变化曲线图并记录超参数写成报告,图示如下:
|
||||
|
||||

|
||||

|
||||
|
||||

|
||||
|
||||

|
||||
|
||||
同时也可以绘制测试(eval)模型时的曲线:
|
||||
|
||||

|
||||
|
||||

|
||||
|
||||

|
||||
|
||||
也可以[tensorboard](https://pytorch.org/docs/stable/tensorboard.html)查看结果,如下:
|
||||
|
||||

|
||||
|
||||
### 注意
|
||||
|
||||
|
||||