Stars
APISR: Anime Production Inspired Real-World Anime Super-Resolution (CVPR 2024)
Diffusion Generated Video Detection (NeurIPS2024)
Official inference repo for FLUX.1 models
Open-Sora: Democratizing Efficient Video Production for All
Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Greatly increase the diversity of your generated images in Automatic1111 WebUI through Condition-Annealed Sampling.
[CVPR 2024] Official repository for "Tactile-Augmented Radiance Fields".
AI-Generated Images as Data Source: The Dawn of Synthetic Era
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
Unofficial implementation of Image Super-Resolution via Iterative Refinement by Pytorch
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Vision transformers with JAX & Flax (ViT, DeiT, LeViT, MAE, ConvPass)
Translate from: https://jax.readthedocs.io/en/latest
《明日方舟》小助手,全日常一键长草!| A one-click tool for the daily tasks of Arknights, supporting all clients.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)
Reading list for research topics in Masked Image Modeling
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Repository for "Space-Time Correspondence as a Contrastive Random Walk" (NeurIPS 2020)
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
[ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"