OLLA

OLLA (Optimizing the Lifetime and Location of Arrays) makes it possible to train larger deep neural networks on existing hardware. OLLA optimizes the order in which the neural network operators are executed to minimize peak memory usage. Furthermore OLLA eliminates memory fragmentation to ensure that no memory is wasted.

Approach

Our approach is described in detail on the OLLA arXiv paper

Getting Started

The source code will be available soon.

Citation

If you use OLLA, please cite us with:

@article{steiner2022olla,
  title={OLLA: Optimizing the Lifetime and Location of Arrays to Reduce the Memory Usage of Neural Networks},
  author={Steiner, Benoit and Elhoushi, Mostafa and Kahn, Jacob, and Hegarty, James},
  doi = {10.48550/arXiv.2210.12924},
  year={2022},
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OLLA

Approach

Getting Started

Citation

About

Releases

Packages

License

jacobkahn/OLLA

Folders and files

Latest commit

History

Repository files navigation

OLLA

Approach

Getting Started

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Packages