Computing the Neural Information Content

About

This repository contains code to compute the information content of an infinitely wide neural network. What is this information content, exactly? Well, it's the negative log probability of the orthant in function space that is consistent with the training labels, where probability is calculated with respect to the Gaussian measure on function space that is induced by the neural architecture.

For a training set with binary class labels, there are three steps to compute it:

For all pairs of training inputs x_i and x_j, compute the correlations between network outputs under random sampling of the network weights: Sigma_ij := Expect_w [ f(x_i,w) f(x_j,w) ].
For random vector z distributed Normally with mean 0 and covariance Sigma, estimate the probability p := Prob[sign z = c] where c is the binary vector of class labels.
Return log 1/p.

And why is this interesting? Well, PAC-Bayes guarantees generalisation when the information content log 1/p is smaller than the number of training data points. More details are given in our paper.

Getting started

Run the unit tests:

python unit_test.py

Run the main script:

python pac-bayes.py

Generate the plots using the Jupyter notebook make_plots.ipynb.

Environment details

The code was run on:

Pytorch 1.5.0
Using docker container nvcr.io/nvidia/pytorch:20.03-py3
On an NVIDIA Titan RTX GPU, with driver version 440.82, CUDA Version: 10.2

For the exact version of the code used in arXiv:2103.01045, go back to commit 49cc144.

Citation

If you find this code useful, feel free to cite the paper:

@inproceedings{entropix,
  title={Computing the Information Content of Trained Neural Networks},
  author={Jeremy Bernstein and Yisong Yue},
  booktitle={Workshop on the Theory of Overparameterized Machine Learning},
  year={2021}
}

License

We are making our algorithm available under a CC BY-NC-SA 4.0 license.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
figures		figures
logs/pac-bayes		logs/pac-bayes
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
make_plots.ipynb		make_plots.ipynb
orthant.svg		orthant.svg
pac-bayes.py		pac-bayes.py
unit_test.py		unit_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computing the Neural Information Content

About

Getting started

Environment details

Citation

License

About

Languages

License

jxbz/entropix

Folders and files

Latest commit

History

Repository files navigation

Computing the Neural Information Content

About

Getting started

Environment details

Citation

License

About

Resources

License

Stars

Watchers

Forks

Languages