Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"

Python 860 108 Updated Oct 29, 2024

xszyou / Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,778 1,858 Updated Jan 8, 2025

Easonyesheng / A2PM-MESA

[CVPR'24] MESA: Matching Everything by Segmenting Anything

Python 118 11 Updated Jan 21, 2025

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,777 277 Updated Nov 18, 2024

YUANZHUO-BNU / metahuman_overview

数字人资料整理

616 73 Updated Jan 8, 2025

nadermx / backgroundremover

Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source.

Python 7,007 590 Updated Dec 26, 2024

shadowcz007 / comfyui-mixlab-nodes

Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS

JavaScript 1,421 94 Updated Jan 25, 2025

cozheyuanzhangde / Invariant-TemplateMatching

Rotation & scale invariant template matching

Python 133 33 Updated Jul 13, 2024

DennisLiu1993 / Fastest_Image_Pattern_Matching

C++ implementation of a ScienceDirect paper "An accelerating cpu-based correlation-based image alignment for real-time automatic optical inspection"

C++ 891 210 Updated Jul 15, 2024

adobe-type-tools / afdko

Adobe Font Development Kit for OpenType

C 1,073 168 Updated Jan 26, 2025

UX-Decoder / DINOv

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 430 19 Updated Apr 8, 2024

nomadkaraoke / python-audio-separator

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

Python 598 99 Updated Dec 31, 2024

ruanyf / weekly

科技爱好者周刊，每周五发布

51,658 3,087 Updated Jan 24, 2025

phh95 / Awesome-design-tools

595 59 Updated Sep 7, 2022

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 43,307 6,293 Updated Jan 26, 2025

vonage-garage-rip / AnsweringMachineDetection

Jupyter Notebook 9 21 Updated Dec 8, 2022

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 4,181 464 Updated Jan 23, 2025

imputnet / cobalt

best way to save what you love

Svelte 27,078 2,168 Updated Jan 25, 2025

ZigeW / data_management_LLM

Collection of training data management explorations for large language models

305 29 Updated Aug 2, 2024

tomas-gajarsky / facetorch

Python library for analysing faces using PyTorch

Python 530 46 Updated Nov 17, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,791 1,376 Updated Dec 25, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 64,796 6,944 Updated Jan 26, 2025

chflame163 / ComfyUI_LayerStyle

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Python 1,828 110 Updated Jan 26, 2025

ZitongYu / DeepFAS

🔥Deep Learning for Face Anti-Spoofing

557 67 Updated Jul 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wentao wentao-uw

Achievements