Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official implementation of the paper "DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion".
Official PyTorch implementation of "Expressive Whole-Body 3D Gaussian Avatar", ECCV 2024.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Link Android and PC easily! 全能手机连接助手!
Real-time image and video processing library similar to GPUImage, with built-in beauty filters, Written in C++11 and based on OpenGL/ES.
This repo includes Claude prompt curation to use Claude better.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
TinyDB is a lightweight document oriented database optimized for your happiness :)
The most advanced browser fingerprinting library.
A TikTok Clone in Flutter and Firebase.
Windows 平台的 FRP GUI 客户端 / A user-friendly desktop GUI client for FRP on Windows.
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
ComfyUI node for background removal, implementing InSPyreNet the best method up to date
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Ready-to-use and customizable users management for FastAPI
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".