diff --git a/papers/Policy_gradient/Addressing Function Approximation Error in Actor-Critic Methods.md b/papers/Policy_gradient/Addressing Function Approximation Error in Actor-Critic Methods.md index bc62b88..c18391e 100644 --- a/papers/Policy_gradient/Addressing Function Approximation Error in Actor-Critic Methods.md +++ b/papers/Policy_gradient/Addressing Function Approximation Error in Actor-Critic Methods.md @@ -134,7 +134,13 @@ $$ 3. 通过添加平滑噪声的方式优化方差计算时的峰值,解决由于估值函数对真实值拟合不精确带来的方差问题。 - +======== + +作者:王振凯 + +研究方向:深度学习、强化学习 + +河北地质大学研究生在读