[Bug] ModelTrainer.maybe_get_best_weights() does not deal properly with negative evaluation scores #102

jan1854 · 2021-07-07T15:14:55Z

Line 220 in 621832f

improvement = (best_val_score - val_score) / best_val_score

The above calculation of the relative improvement of the evaluation score in ModelTrainer seems to be wrong for negative evaluation scores. This can be fixed by adding a torch.abs() around the divisor.

Steps to reproduce

import torch
from mbrl.models import ModelTrainer, GaussianMLP

dummy = GaussianMLP(1, 1, "cpu")
model_trainer = ModelTrainer(dummy)
previous_eval_value = torch.tensor(-1.0)
current_eval_value = torch.tensor(-10.0)
print(model_trainer.maybe_get_best_weights(previous_eval_value, current_eval_value))

Observed Results

model_trainer.maybe_get_best_weights() returns None, which should indicate that the evaluation value did not improve from previous_eval_value to current_eval_value.

Expected Results

The relative improvement from previous_eval_value to current_eval_value is 900%. Thus, model_trainer.maybe_get_best_weights() should return the parameters of the model, which would indicate that the evaluation value improved.

The text was updated successfully, but these errors were encountered:

jan1854 · 2021-07-07T15:22:17Z

While we are on the topic of ModelTrainer, it would be nice if the threshold for improvement could be specified when calling ModelTrainer.train(). Right now threshold is a parameter of ModelTrainer.maybe_get_best_weights(), but not of ModelTrainer.train(). Since different applications deal with different scales of evaluation scores (and relative improvements of these scores), it would be nice to have a bit more flexibility here.

luisenp · 2021-07-07T20:25:15Z

Good point, I never considered this particular case. Do you want to submit a pull request? You've been reporting bugs/fixes for a while, might as well get some contribution credit :)

jan1854 added the bug Something isn't working label Jul 7, 2021

jan1854 mentioned this issue Jul 8, 2021

Bugfix in ModelTrainer.maybe_get_best_weights() and made the threshold for improvement configurable from ModelTrainer.train() #104

Merged

7 tasks

luisenp closed this as completed in #104 Jul 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] ModelTrainer.maybe_get_best_weights() does not deal properly with negative evaluation scores #102

[Bug] ModelTrainer.maybe_get_best_weights() does not deal properly with negative evaluation scores #102

jan1854 commented Jul 7, 2021

jan1854 commented Jul 7, 2021

luisenp commented Jul 7, 2021

[Bug] ModelTrainer.maybe_get_best_weights() does not deal properly with negative evaluation scores #102

[Bug] ModelTrainer.maybe_get_best_weights() does not deal properly with negative evaluation scores #102

Comments

jan1854 commented Jul 7, 2021

Steps to reproduce

Observed Results

Expected Results

jan1854 commented Jul 7, 2021

luisenp commented Jul 7, 2021