Stars
11
stars
written in Python
Clear filter
Fast and memory-efficient exact attention
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
Code for our NeurIPS 2022 paper
Text-writing denoising diffusion (and much more)
Fast Inference in Denoising Diffusion Models via MMD Finetuning