Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Reduce peak VRAM by releasing large attention tensors (as soon as the…
…y're unnecessary) (huggingface#3463) Release large tensors in attention (as soon as they're no longer required). Reduces peak VRAM by nearly 2 GB for 1024x1024 (even after slicing), and the savings scale up with image size.
- Loading branch information