Name		Name	Last commit message	Last commit date
parent directory ..
BUILD		BUILD
README.md		README.md
WORKSPACE		WORKSPACE
algorithm.cc		algorithm.cc
algorithm.h		algorithm.h
algorithm.proto		algorithm.proto
algorithm_test.cc		algorithm_test.cc
algorithm_test_util.cc		algorithm_test_util.cc
algorithm_test_util.h		algorithm_test_util.h
best_algo.gif		best_algo.gif
compute_cost.cc		compute_cost.cc
compute_cost.h		compute_cost.h
definitions.h		definitions.h
definitions_test.cc		definitions_test.cc
eigen.BUILD		eigen.BUILD
evaluator.cc		evaluator.cc
evaluator.h		evaluator.h
evaluator_test.cc		evaluator_test.cc
executor.h		executor.h
executor_test.cc		executor_test.cc
experiment.proto		experiment.proto
experiment_progress.png		experiment_progress.png
experiment_util.cc		experiment_util.cc
experiment_util.h		experiment_util.h
fec_cache.cc		fec_cache.cc
fec_cache.h		fec_cache.h
fec_cache.proto		fec_cache.proto
fec_cache_test.cc		fec_cache_test.cc
fec_hashing.cc		fec_hashing.cc
fec_hashing.h		fec_hashing.h
fec_hashing_test.cc		fec_hashing_test.cc
generator.cc		generator.cc
generator.h		generator.h
generator.proto		generator.proto
generator_test.cc		generator_test.cc
generator_test_util.cc		generator_test_util.cc
generator_test_util.h		generator_test_util.h
initial_and_evolved_code.png		initial_and_evolved_code.png
instruction.cc		instruction.cc
instruction.h		instruction.h
instruction.proto		instruction.proto
instruction_test.cc		instruction_test.cc
memory.cc		memory.cc
memory.h		memory.h
memory_test.cc		memory_test.cc
mutator.cc		mutator.cc
mutator.h		mutator.h
mutator.proto		mutator.proto
mutator_test.cc		mutator_test.cc
progress.gif		progress.gif
random_generator.cc		random_generator.cc
random_generator.h		random_generator.h
random_generator_test.cc		random_generator_test.cc
randomizer.cc		randomizer.cc
randomizer.h		randomizer.h
regularized_evolution.cc		regularized_evolution.cc
regularized_evolution.h		regularized_evolution.h
regularized_evolution_test.cc		regularized_evolution_test.cc
run_demo.sh		run_demo.sh
run_integration_test_linear.sh		run_integration_test_linear.sh
run_integration_test_nonlinear.sh		run_integration_test_nonlinear.sh
run_integration_tests.sh		run_integration_tests.sh
run_search_experiment.cc		run_search_experiment.cc
task.h		task.h
task.proto		task.proto
task_test.cc		task_test.cc
task_util.cc		task_util.cc
task_util.h		task_util.h
task_util_test.cc		task_util_test.cc
test_util.h		test_util.h
test_util_test.cc		test_util_test.cc
train_budget.cc		train_budget.cc
train_budget.h		train_budget.h
train_budget.proto		train_budget.proto
train_budget_test.cc		train_budget_test.cc
util.cc		util.cc
util.h		util.h
util_test.cc		util_test.cc

README.md

AutoML-Zero

Open source code for the paper: "AutoML-Zero: Evolving Machine Learning Algorithms From Scratch"

Introduction	Quick Demo	Reproducing Search Baselines	Citation

What is AutoML-Zero?

AutoML-Zero aims at automatically discovering computer programs to solve machine learning tasks, starting from empty or random programs and using only basic math operations. The goal is to simultaneously search for all aspects of an ML algorithm (e.g., the model structure and the learning strategy), while employing minimal human bias.

Despite the challenging search space for AutoML-Zero, Evolutionary Search showed promising results by discovering linear regression, 2-layer neural network with backpropagation, and even algorithms better than hand designed baselines of comparable complexity. An example sequence of discoveries on binary classification tasks is shown above. More importantly, the evolved algorithms can be interpreted. Below is an analysis of the best evolved algorithm, which "invents" techniques like bilinear interactions, weight averaging, normalized gradient and adding noise to inputs.

Note that the programs shown above are already simplified and reordered for better readability. The raw programs, and the details about more experiments and analyses can be found in the paper.

5-Minute Demo: Discovering Linear Regression From Scratch

As a miniature "AutoML-Zero" experiment, let's try to automatically discover programs to solve linear regression tasks.

To get started, first install bazel following instructions here, then run the demo with:

git clone https://github.com/google-research/google-research.git
cd google-research/automl_zero
./run_demo.sh

This script runs evolutionary search on 10 linear tasks (T_search in the paper). After each experiment, it evaluates the best algorithm discovered on 100 new linear tasks (T_select in the paper). Once an algorithm attains a fitness (1 - RMS error) greater than 0.9999, it is selected for a final evaluation on 100 unseen tasks. To conclude, the demo prints the results of the final evaluation and shows the code for the automatically discovered algorithm.

To make this demo quick, we use a much smaller search space: only the math operations necessary to implement linear regression are allowed and the programs are constrained to a short, fixed length. This way, the demo will typically discover programs similar to linear regression by gradient descent in under 5 minutes using 1 CPU (Note that the runtime may vary due to the random seeds and hardware). We saw similar discoveries in the unconstrained search space, although at a higher compute costs.

You can compare the automatically discovered algorithm with the solution from a human ML researcher (one of the authors):

def Setup():
  s2 = 0.001  # Init learning rate.

def Predict():  # v0 = features
  s1 = dot(v0, v1)  # Apply weights

def Learn():  # v0 = features; s0 = label
  s3 = s0 - s1  # Compute error.
  s4 = s3 * s1  # Apply learning rate.
  v2 = v0 * s4  # Compute gradient.
  v1 = v1 + v2  # Update weights.

In this human designed program, the Setup function establishes a learning rate, the Predict function applies a set of weights to the inputs, and the Learn function corrects the weights in the opposite direction to the gradient. In other words, a linear regressor trained with gradient descent. You might have noticed that the evolved programs may order the instructions very differently and usually contain many redundant instructions, which can make it challenging to interpret. See more details about how we address these problems in the paper.

Reproducing Search Baselines

First install bazel following instructions here,then run the following command to reproduce the results in Supplementary Section 9 ("Baselines") with the "Basic" method on 1 process (1 CPU):

[To be continued, ETA: March, 2020]

If you want to use more than 1 process, you will need to create your own implementation to parallelize the computation based on your particular distributed-computing platform. A platform-agnostic description of what we did is given in our paper.

We left out of this directory upgrades for the "Full" method that are pre-existing (hurdles) but included those introduced in this paper (e.g. FEC for ML algorithms).

Citation

If you use the code in your research, please cite:

TODO

^{_{Search keywords: machine learning, neural networks, evolution,
evolutionary algorithms, regularized evolution, program synthesis,
architecture search, NAS, neural architecture search,
neuro-architecture search, AutoML, AutoML-Zero, algorithm search,
meta-learning, genetic algorithms, genetic programming, neuroevolution,
neuro-evolution.}}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automl_zero

automl_zero

README.md

AutoML-Zero

What is AutoML-Zero?

5-Minute Demo: Discovering Linear Regression From Scratch

Reproducing Search Baselines

Citation

Files

automl_zero

Directory actions

More options

Directory actions

More options

Latest commit

History

automl_zero

Folders and files

parent directory

README.md

AutoML-Zero

What is AutoML-Zero?

5-Minute Demo: Discovering Linear Regression From Scratch

Reproducing Search Baselines

Citation