Skip to content

Commit

Permalink
Add reward histogram in parametric DQN
Browse files Browse the repository at this point in the history
Summary: Following D17536379, we add a reward histogram plot in parametric DQN workflows

Reviewed By: kittipatv

Differential Revision: D17682383

fbshipit-source-id: 5529ef07f411e77e938e008e84a3a225562c59f4
  • Loading branch information
czxttkl authored and facebook-github-bot committed Oct 9, 2019
1 parent 6d673a7 commit 755ba3a
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions ml/rl/training/parametric_dqn_trainer.py
Original file line number Diff line number Diff line change
Expand Up @@ -133,6 +133,7 @@ def train(self, training_batch) -> None:
self.loss_reporter.report(
td_loss=self.loss,
reward_loss=reward_loss,
logged_rewards=reward,
model_values_on_logged_actions=self.all_action_scores,
)

Expand Down

0 comments on commit 755ba3a

Please sign in to comment.