Skip to content

Commit

Permalink
Fix missing learning rate and entropy coeff schedule for torch PPO (r…
Browse files Browse the repository at this point in the history
  • Loading branch information
janblumenkamp authored May 23, 2020
1 parent 2ab1b77 commit d6f78f5
Showing 1 changed file with 4 additions and 1 deletion.
5 changes: 4 additions & 1 deletion rllib/agents/ppo/ppo_torch_policy.py
Original file line number Diff line number Diff line change
Expand Up @@ -223,4 +223,7 @@ def setup_mixins(policy, obs_space, action_space, config):
extra_grad_process_fn=apply_grad_clipping,
before_init=setup_config,
after_init=setup_mixins,
mixins=[KLCoeffMixin, ValueNetworkMixin])
mixins=[
LearningRateSchedule, EntropyCoeffSchedule, KLCoeffMixin,
ValueNetworkMixin
])

0 comments on commit d6f78f5

Please sign in to comment.