Stars
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
High-Resolution Image Synthesis with Latent Diffusion Models
Making large AI models cheaper, faster and more accessible
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Generative Models by Stability AI
Datasets, Transforms and Models specific to Computer Vision
End-to-End Object Detection with Transformers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
A Collection of Variational Autoencoders (VAE) in PyTorch.
Synthesizing and manipulating 2048x1024 images with conditional GANs
📷 EasyPhoto | Your Smart AI Photo Generator.
A concise but complete full-attention transformer with a set of promising experimental features from various papers
Count the MACs / FLOPs of your PyTorch model.
Denoising Diffusion Probabilistic Models
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
[WIP] Layer Diffusion for WebUI (via Forge)
3D ResNets for Action Recognition (CVPR 2018)
A collection of loss functions for medical image segmentation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)