Skip to content
View yangfeng1685's full-sized avatar

Block or report yangfeng1685

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,193 1,102 Updated Feb 26, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 4,165 341 Updated Feb 27, 2025
Python 481 26 Updated Feb 18, 2025

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 3,019 329 Updated Dec 5, 2024

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Python 11,978 1,334 Updated Feb 27, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 1,847 151 Updated Feb 10, 2025

This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.

Python 452 36 Updated Feb 21, 2025

Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI

Python 596 51 Updated Feb 25, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,508 260 Updated Feb 19, 2025

Official repository of In-Context LoRA for Diffusion Transformers

1,624 81 Updated Dec 20, 2024

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 2,946 347 Updated Feb 27, 2025

Bring portraits to life!

Python 14,180 1,525 Updated Feb 13, 2025

机场推荐/SSR V2ray节点订阅机场/镜像直连/工具推荐

10,243 1,165 Updated Feb 18, 2025

SUPIR upscaling wrapper for ComfyUI

Python 1,764 99 Updated Aug 1, 2024

Official repository for LTX-Video

Python 2,926 251 Updated Feb 16, 2025

简体中文版 ComfyUI

Python 499 35 Updated Dec 20, 2024

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

5,927 550 Updated Dec 20, 2024

Official inference repo for FLUX.1 models

Python 20,468 1,437 Updated Feb 6, 2025

🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。

Python 2,289 283 Updated Jul 2, 2024

ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview

Python 619 34 Updated Jul 2, 2024

A generative speech model for daily dialogue.

Python 34,747 3,746 Updated Feb 18, 2025

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,742 804 Updated Dec 9, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,346 4,614 Updated Feb 27, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,425 830 Updated Jul 18, 2024
Next