Starred repositories
All Algorithms implemented in Python
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Real-time face swap for PC streaming or video calls
Easily train a good VC model with voice data <= 10 mins!
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
WebUI extension for ControlNet
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
An open source implementation of CLIP.
Simple, unified interface to multiple Generative AI providers
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
High-Resolution 3D Human Digitization from A Single Image.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Count the MACs / FLOPs of your PyTorch model.
Nightly release of ControlNet 1.1
Perceptual video quality assessment based on multi-method fusion.
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project.
Direct voxel grid optimization for fast radiance field reconstruction.
📼 Package media content for online streaming(DASH and HLS) using FFmpeg
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention