Skip to content
View wentao-uw's full-sized avatar

Block or report wentao-uw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 395 25 Updated Jan 24, 2025

Memory-Guided Diffusion for Expressive Talking Video Generation

Python 685 69 Updated Jan 24, 2025

视频号、小程序、抖音、快手、小红书、直播流、m3u8、酷狗、QQ音乐等常见网络资源下载!

Go 4,566 627 Updated Jan 22, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,202 1,228 Updated Jan 22, 2025

Taming Stable Diffusion for Lip Sync!

Python 2,129 256 Updated Jan 19, 2025

Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"

Python 860 108 Updated Oct 29, 2024

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,778 1,858 Updated Jan 8, 2025

[CVPR'24] MESA: Matching Everything by Segmenting Anything

Python 118 11 Updated Jan 21, 2025

High-resolution models for human tasks.

Python 4,777 277 Updated Nov 18, 2024

数字人资料整理

616 73 Updated Jan 8, 2025

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Python 7,007 590 Updated Dec 26, 2024

Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS

JavaScript 1,421 94 Updated Jan 25, 2025

Rotation & scale invariant template matching

Python 133 33 Updated Jul 13, 2024

C++ implementation of a ScienceDirect paper "An accelerating cpu-based correlation-based image alignment for real-time automatic optical inspection"

C++ 891 210 Updated Jul 15, 2024

Adobe Font Development Kit for OpenType

C 1,073 168 Updated Jan 26, 2025

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 598 99 Updated Dec 31, 2024

科技爱好者周刊,每周五发布

51,658 3,087 Updated Jan 24, 2025

real time face swap and one-click video deepfake with only a single image

Python 43,307 6,293 Updated Jan 26, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,181 464 Updated Jan 23, 2025

best way to save what you love

Svelte 27,078 2,168 Updated Jan 25, 2025

Collection of training data management explorations for large language models

305 29 Updated Aug 2, 2024

Python library for analysing faces using PyTorch

Python 530 46 Updated Nov 17, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,791 1,376 Updated Dec 25, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 64,796 6,944 Updated Jan 26, 2025

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Python 1,828 110 Updated Jan 26, 2025

🔥Deep Learning for Face Anti-Spoofing

557 67 Updated Jul 11, 2023
Next