Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
__init__.py		__init__.py
base_arch.py		base_arch.py
base_trainer.py		base_trainer.py
common_trainer.py		common_trainer.py
custom_getters.py		custom_getters.py
device_utils.py		device_utils.py
es_grad_inv_var.py		es_grad_inv_var.py
fast_rolling_mlp.py		fast_rolling_mlp.py
py_utils.py		py_utils.py
requirements.txt		requirements.txt
run_chief.py		run_chief.py
run_single_eval.py		run_single_eval.py
run_worker.py		run_worker.py
session_creators.py		session_creators.py
setup_experiment.py		setup_experiment.py
tf_utils.py		tf_utils.py
truncated_training.py		truncated_training.py
truncation_strategy.py		truncation_strategy.py
utils_arch.py		utils_arch.py

README.md

Partial code for "Understanding and correcting pathologies in the training of learned optimizers"

Authors: Luke Metz, Niru Maheswaranathan, Jeremy Nixon, C. Daniel Freeman, Jascha Sohl-Dickstein

Paper: https://arxiv.org/abs/1810.10180

What is in this folder

Due to the distributed nature of this project, there is a lot of coupling to internal Google infrastructure and releasing a fully running example is not possible at this time. This folder contains the core components of our paper (learned optimizer architecture, evolutionary strategies + gradient based training algorithm) + some stubbed out code showing how the distributed training would work.

The architecture for the learned optimizer can be found in fast_rolling_mlp.py.

The evolutionary strategies + reparameterization gradient trainer can be found in es_grad_inv_var.py

The cluster is started with run_chief.py which is the chief worker that manages performing gradient updates on the learned optimizer. run_worker.py runs a worker. These workers iteratively recreate a training graph, and push gradients to the parameter servers.

run_single_eval.py shows how one would use a learned optimizer. This consists of sequential applications of the Learner defined in fast_rolling_mlp.py.

The remaining files are helpers / utilities.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

task_specific_learned_opt

task_specific_learned_opt

README.md

Partial code for "Understanding and correcting pathologies in the training of learned optimizers"

What is in this folder

Files

task_specific_learned_opt

Directory actions

More options

Directory actions

More options

Latest commit

History

task_specific_learned_opt

Folders and files

parent directory

README.md

Partial code for "Understanding and correcting pathologies in the training of learned optimizers"

What is in this folder