Starred repositories
The official GitHub mirror of the Chromium source
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
This repo contains source code and materials for the TEmporally COherent GAN SIGGRAPH project.
Enhancements & experiments for ComfyUI, mostly focusing on UI features
Custom nodes that extend the capabilities of Comfyui
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
Instant voice cloning by MIT and MyShell.
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。| 基于 Webgpu 技术和 wasm 技术的免费开源 inpainting & image-upscaling 工具, 纯浏览器端实现。
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Simple and easy to use DDNS. Support Aliyun, Tencent Cloud, Dnspod, Cloudflare, Callback, Huawei Cloud, Baidu Cloud, Porkbun, GoDaddy, Namecheap, NameSilo...
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)