Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative speech model for daily dialogue.
Real-time face swap for PC streaming or video calls
SoftVC VITS Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
Industry leading face manipulation platform
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
GUI for a Vocal Remover that uses Deep Neural Networks.
リアルタイムボイスチェンジャー Realtime Voice Changer
Fast and memory-efficient exact attention
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Automate Creation of YouTube Shorts using MoviePy.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
High-Resolution 3D Human Digitization from A Single Image.
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.