更新算法模版

This commit is contained in:
johnjim0816
2022-11-06 12:15:36 +08:00
parent 466a17707f
commit dc78698262
256 changed files with 17282 additions and 10229 deletions

View File

@@ -5,7 +5,7 @@ Author: John
Email: johnjim0816@gmail.com
Date: 2020-11-22 23:27:44
LastEditor: John
LastEditTime: 2022-08-27 13:45:26
LastEditTime: 2022-10-09 21:28:18
Discription:
Environment:
'''
@@ -31,8 +31,6 @@ class PolicyGradient:
state = torch.from_numpy(state).float()
state = Variable(state)
probs = self.policy_net(state)
print("probs")
print(probs)
m = Bernoulli(probs) # 伯努利分布
action = m.sample()