wwwei1997

Follow

Wei wwwei1997

Follow

5 followers · 12 following

xjtu
西安

Lists (23)

Sort

3D

Audio/ASR

Audio/AudioSeparation

Audio/BaseModel

Audio/Data

13 repositories

Audio/TTS

28 repositories

Audio/VC

Image/BaseModel

Image/Detection

13 repositories

Image/reid

Image/Segmentation

ModelTraining

MultiModal/3DGen

MultiModal/BaseModel

10 repositories

MultiModal/ImageGen

29 repositories

MultiModal/TalkingHead

24 repositories

MultiModal/VideoGen

10 repositories

NeRF

NLP

13 repositories

Other

45 repositories

Python Tools

Video

Video/Data

Stars

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 543 97 Updated Jun 25, 2024

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,145 164 Updated Feb 13, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 2,877 427 Updated Jan 19, 2025

IDEA-Research / T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Python 2,414 160 Updated Oct 21, 2024

deepseek-ai / DeepSeek-V3

Python 91,636 14,838 Updated Feb 24, 2025

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 7,202 556 Updated Feb 26, 2025

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,143 494 Updated Feb 26, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,704 2,786 Updated Mar 10, 2025

python-poetry / poetry

Python packaging and dependency management made easy

Python 32,778 2,334 Updated Mar 10, 2025

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 43,119 1,219 Updated Mar 10, 2025

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 9,152 753 Updated Mar 6, 2025

freqtrade / freqtrade

Free, open source crypto trading bot

Python 37,128 7,305 Updated Mar 10, 2025

myhhub / stock

stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。

Python 7,825 1,533 Updated Mar 3, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,208 1,397 Updated Feb 24, 2025

MRzzm / HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Python 370 69 Updated May 12, 2024

FACEGOOD / FACEGOOD-Audio2Face

http://www.facegood.cc

Python 1,856 362 Updated Feb 8, 2023

anothermartz / Easy-Wav2Lip

Forked from GucciFlipFlops1917/wav2lip-hq-updated-ESRGAN

Colab for making Wav2Lip high quality and easy to use

Jupyter Notebook 777 140 Updated May 17, 2024

yangkang2021 / I_am_a_person

实时互动的GPT数字人

Python 404 90 Updated Dec 26, 2024

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 11,734 1,135 Updated Feb 16, 2025

barisgecer / GANFit

Project Page of 'GANFIT: Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction' [CVPR2019]

Python 646 66 Updated Nov 9, 2021

sicxu / Deep3DFaceRecon_pytorch

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

Python 1,785 324 Updated Nov 26, 2024

uniBruce / Mead

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]

Python 256 29 Updated Jul 7, 2024

CelebV-HQ / CelebV-HQ

[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

Python 416 34 Updated Jan 4, 2023

colmap / colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

C++ 8,297 1,596 Updated Mar 10, 2025

NVlabs / instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

Cuda 16,377 1,953 Updated Jan 27, 2025

cnlinxi / book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

TeX 596 80 Updated Apr 19, 2022

yerfor / GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,588 293 Updated Oct 18, 2024

ashawkey / RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Python 912 156 Updated Apr 4, 2024

YudongGuo / AD-NeRF

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Python 1,049 179 Updated Oct 27, 2023

Fictionarry / ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,149 140 Updated Feb 28, 2025