From 8dcc35bf6a91b8e8529fc14c7f08d00c14fe3341 Mon Sep 17 00:00:00 2001 From: Yiyuan Yang Date: Mon, 21 Nov 2022 19:09:00 +0800 Subject: [PATCH] Update readme.md --- papers/readme.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/papers/readme.md b/papers/readme.md index 7754384..a9bfb9d 100644 --- a/papers/readme.md +++ b/papers/readme.md @@ -26,7 +26,7 @@ | | Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor(**SAC**) [[Markdown格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/Soft%20Actor-Critic_Off-Policy%20Maximum%20Entropy%20Deep%20Reinforcement%20Learning%20with%20a%20Stochastic%20Actor.md) [[PDF格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/PDF/Soft%20Actor-Critic_Off-Policy%20Maximum%20Entropy%20Deep%20Reinforcement%20Learning%20with%20a%20Stochastic%20Actor.pdf) | https://arxiv.org/abs/1801.01290 | | | | Deterministic Policy Gradient Algorithms (**DPG**) [[Markdown格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/Deterministic%20Policy%20Gradient%20Algorithms.md) [[PDF格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/PDF/Deterministic%20Policy%20Gradient%20Algorithms.pdf) | http://proceedings.mlr.press/v32/silver14.pdf | | | | Continuous Control With Deep Reinforcement Learning (**DDPG**) | https://arxiv.org/abs/1509.02971 | | -| | Addressing Function Approximation Error in Actor-Critic Methods (**TD3**) [[Markdown格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/Addressing%20Function%20Approximation%20Error%20in%20Actor-Critic%20Methods.md) | https://arxiv.org/abs/1802.09477 | | +| | Addressing Function Approximation Error in Actor-Critic Methods (**TD3**) [[Markdown格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/Addressing%20Function%20Approximation%20Error%20in%20Actor-Critic%20Methods.md) [[PDF格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/PDF/Addressing%20Function%20Approximation%20Error%20in%20Actor-Critic%20Methods.pdf)| https://arxiv.org/abs/1802.09477 | | | | A Distributional Perspective on Reinforcement Learning (**C51**) [[Markdown格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/A%20Distributional%20Perspective%20on%20Reinforcement%20Learning.md) [[PDF格式]](https://github.com/datawhalechina/easy-rl/blob/master/papers/Policy_gradient/PDF/A%20Distributional%20Perspective%20on%20Reinforcement%20Learning.pdf) | https://arxiv.org/abs/1707.06887 | | | | | | |