update README

2021-03-23 16:07:30 +08:00
parent 1c7339e08b
commit d4690c2058
2 changed files with 28 additions and 27 deletions
@@ -31,7 +31,7 @@
 ## 算法代码实现一览
 |                           算法名称                           |                        相关论文材料                         |                             备注                             | 进度 |
-| :----------------------------------------------------------: | :---------------------------------------------------------: | :--------------------------------: | :--: |
+| :----------------------------------------------------------: | :---------------------------------------------------------: | :----------------------------------------------------------: | :--: |
 | [On-Policy First-Visit MC](https://github.com/datawhalechina/easy-rl/tree/master/codes/MonteCarlo) |                                                             |                         蒙特卡洛算法                         |  OK  |
 | [Q-Learning](https://github.com/datawhalechina/easy-rl/tree/master/codes/QLearning) |                                                             |                                                              |  OK  |
 | [Sarsa](https://github.com/datawhalechina/easy-rl/tree/master/codes/Sarsa) |                                                             |                                                              |  OK  |
@@ -41,6 +41,7 @@
 |                       Hierarchical DQN                       |    [Hierarchical DQN](https://arxiv.org/abs/1604.06057)     |                                                              |      |
 | [PolicyGradient](https://github.com/datawhalechina/easy-rl/tree/master/codes/PolicyGradient) |                                                             |                                                              |  OK  |
 | [A2C](https://github.com/datawhalechina/easy-rl/tree/master/codes/A2C) |                                                             |                                                              |  OK  |
 | [PPO](https://github.com/datawhalechina/easy-rl/tree/master/codes/PPO) |        [PPO paper](https://arxiv.org/abs/1707.06347)        | [PPO算法实战](https://blog.csdn.net/JohnJim0/article/details/115126363) |  OK  |
 |                             DDPG                             |       [DDPG Paper](https://arxiv.org/abs/1509.02971)        |                                                              |  OK  |
 |                             TD3                              | [Twin Dueling DDPG Paper](https://arxiv.org/abs/1802.09477) |                                                              |      |
@@ -31,7 +31,7 @@
 ## 算法代码实现一览
 |                           算法名称                           |                        相关论文材料                         |                             备注                             | 进度 |
-| :----------------------------------------------------------: | :---------------------------------------------------------: | :--------------------------------: | :--: |
+| :----------------------------------------------------------: | :---------------------------------------------------------: | :----------------------------------------------------------: | :--: |
 | [On-Policy First-Visit MC](https://github.com/datawhalechina/easy-rl/tree/master/codes/MonteCarlo) |                                                             |                         蒙特卡洛算法                         |  OK  |
 | [Q-Learning](https://github.com/datawhalechina/easy-rl/tree/master/codes/QLearning) |                                                             |                                                              |  OK  |
 | [Sarsa](https://github.com/datawhalechina/easy-rl/tree/master/codes/Sarsa) |                                                             |                                                              |  OK  |
@@ -41,10 +41,10 @@
 |                       Hierarchical DQN                       |    [Hierarchical DQN](https://arxiv.org/abs/1604.06057)     |                                                              |      |
 | [PolicyGradient](https://github.com/datawhalechina/easy-rl/tree/master/codes/PolicyGradient) |                                                             |                                                              |  OK  |
 | [A2C](https://github.com/datawhalechina/easy-rl/tree/master/codes/A2C) |                                                             |                                                              |  OK  |
 | [PPO](https://github.com/datawhalechina/easy-rl/tree/master/codes/PPO) |        [PPO paper](https://arxiv.org/abs/1707.06347)        | [PPO算法实战](https://blog.csdn.net/JohnJim0/article/details/115126363) |  OK  |
 |                             DDPG                             |       [DDPG Paper](https://arxiv.org/abs/1509.02971)        |                                                              |  OK  |
 |                             TD3                              | [Twin Dueling DDPG Paper](https://arxiv.org/abs/1802.09477) |                                                              |      |
 ## 贡献者
 <table border="0">