Starred repositories
PyTorch code and models for the DINOv2 self-supervised learning method.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Simple, online, and realtime tracking of multiple objects in a video sequence.
A latent text-to-image diffusion model
Generative Models by Stability AI
Refine high-quality datasets and visual AI models
Official repo for consistency models.
High-Resolution Image Synthesis with Latent Diffusion Models
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
[ICLR2022] official implementation of UniFormer
[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search
😎 Awesome lists about all kinds of interesting topics
A curated list of awesome model based RL resources (continually updated)
Decision Intelligence platform for Traffic Crossing Signal Control
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
This is a collection of our NAS and Vision Transformer work.
All Algorithms implemented in Python
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
Official PyTorch implementation of StyleGAN3
Open source code for AlphaFold 2.
[TIP 2022] CBNetV2: A Composite Backbone Network Architecture for Object Detection
OpenDILab RL HPC OP Lib, including CUDA and Triton kernel