loto12345

loto12345

3 followers · 2 following

Starred repositories

347 results for source starred repositories

Clear filter

reworkd / AgentGPT

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 32,678 9,321 Updated Oct 7, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,245 2,374 Updated Nov 26, 2024

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 1,464 213 Updated Nov 13, 2024

Fictionarry / ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,126 140 Updated Jul 12, 2024

lipku / LiveTalking

Real time interactive streaming digital human

Python 4,398 644 Updated Jan 29, 2025

1adrianb / face-alignment

🔥 2D and 3D Face alignment library build using pytorch

Python 7,193 1,355 Updated Aug 30, 2024

facefusion / facefusion

Industry leading face manipulation platform

Python 21,217 3,212 Updated Feb 1, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,648 4,451 Updated Jan 18, 2025

zllrunning / face-parsing.PyTorch

Using modified BiSeNet for face parsing in PyTorch

Python 2,373 465 Updated May 21, 2023

TMElyralab / MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,577 272 Updated Jun 28, 2024

TMElyralab / MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Python 2,414 177 Updated Aug 7, 2024

TMElyralab / MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,350 431 Updated Nov 27, 2024

DanielSWolf / rhubarb-lip-sync

Rhubarb Lip Sync is a command-line tool that automatically creates 2D mouth animation from voice recordings. You can use it for characters in computer games, in animated cartoons, or in any other p…

C++ 1,915 235 Updated Jan 7, 2025

X-LANCE / AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Jupyter Notebook 1,522 140 Updated Aug 15, 2024

s0md3v / roop

one-click face swap

Python 29,134 6,579 Updated Aug 19, 2024

OpenTalker / SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,255 2,286 Updated Jun 26, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 4,209 371 Updated Jan 8, 2025

edwko / OuteTTS

Interface for OuteTTS models.

Python 906 77 Updated Jan 29, 2025

polm / unidic-py

Unidic packaged for installation via pip.

Python 85 10 Updated Jun 16, 2023

mlc-ai / web-llm

High-performance In-browser LLM Inference Engine

TypeScript 14,518 938 Updated Jan 21, 2025

ant-design / x

Craft AI-driven interfaces effortlessly 🤖

TypeScript 2,227 216 Updated Jan 30, 2025

ant-design / pro-chat

🤖 Components Library for Quickly Building LLM Chat Interfaces.

TypeScript 775 103 Updated Nov 26, 2024

THUDM / ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,761 1,856 Updated Jun 27, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 7,872 822 Updated Feb 1, 2025

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,283 1,099 Updated Jan 10, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,732 2,212 Updated Jan 30, 2025

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,015 567 Updated Jun 16, 2024

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook 8,586 1,166 Updated Nov 13, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,787 1,153 Updated Jan 1, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 13,305 1,630 Updated Feb 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly