Stars
Stable Diffusion web UI
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
High-Resolution Image Synthesis with Latent Diffusion Models
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
⚡ A Fast, Extensible Progress Bar for Python and CLI
Real-time face swap for PC streaming or video calls
SoftVC VITS Singing Voice Conversion
Deezer source separation library including pretrained models.
State-of-the-art 2D and 3D Face Analysis Project
Image-to-Image Translation in PyTorch
Industry leading face manipulation platform
Rembg is a tool to remove images background
A Deep Learning based project for colorizing and restoring old images (and video!)
WebUI extension for ControlNet
DeepFaceLab is the leading software for creating deepfakes.
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
Fast and memory-efficient exact attention
Bringing Old Photo Back to Life (CVPR 2020 oral)
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Super Resolution for images using deep learning.
An open source implementation of CLIP.