Name		Name	Last commit message	Last commit date
parent directory ..
cloud_tpu_resnet		cloud_tpu_resnet
README.md		README.md
__init__.py		__init__.py
dump_metrics_to_csv.py		dump_metrics_to_csv.py
exp_igt_optimizer.py		exp_igt_optimizer.py
exp_igt_optimizer_test.py		exp_igt_optimizer_test.py
requirements.txt		requirements.txt
run.sh		run.sh

README.md

Reducing the variance in online optimization by transporting past gradients

This directory contains a TensorFlow implementation of an Implicit Gradient Transport optimizer using Anytime Average. For details, see Reducing the variance in online optimization by transporting past gradients.

Implementation details

The optimizer relies on a gradient extrapolation (the gradient is not computed as the parameter values). The present implementation relies on the variables containing the shifted parameters. The true parameters are instead contained in associated slots. This is an important distinction, especially when considering whether the learning curves are for the true or the shifted paramemeters.

Code overview

The experimental framework is centered on a fork of the Cloud TPU resnet code from May 2019.

resnet_main.py is the main executable. Important flags are:

mode, which offers a special eval_igt mode for evaluating an IGT model at the true parameters (vs shifted ones). This value should be used in conjunction with the igt_eval_mode and igt_eval_set flags.
optimizer, for setting the optimizer
igt_optimizer, for setting the optimizer to use in conjunction with IGT
tail_fraction, for setting IGT's any time average data window
lr_decay and lr_decay_step_fraction

dump_metrics_to_csv.py is used to convert the learning curves from their TensorFlow summary format to an easier to consume csv format.

Citation

If you use this code for your publication, please cite the original paper:

@inproceedings{clark2019bam,
  title = {Reducing the variance in online optimization by transporting past
           gradients},
  author = {Sébastien Arnold and Pierre-Antoine Manzagol and Reza Harikandeh
            and Ioannis Mitliagkas and Nicolas Le Roux (Google Brain)},
  booktitle = {NeurIPS},
  year = {2019}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

igt_optimizer

igt_optimizer

README.md

Reducing the variance in online optimization by transporting past gradients

Implementation details

Code overview

Citation

Files

igt_optimizer

Directory actions

More options

Directory actions

More options

Latest commit

History

igt_optimizer

Folders and files

parent directory

README.md

Reducing the variance in online optimization by transporting past gradients

Implementation details

Code overview

Citation