Stars
An ultra-modern minimalistic theme for SiYuan Note.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Alfred Workflow to extract annotations from PDF files.
ComfyUI nodes for Lotus depth/normal prediction
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
for tile the image for advanced control or modification
A powerful anti-burn allowing much higher CFG scales for latent diffusion models (for ComfyUI)
ControlNet collections for Flux1-dev model, Trained by TheMisto.ai Team
ComfyUI's ControlNet Auxiliary Preprocessors
ControlNet scheduling and masking nodes with sliding context support
📚 Web app for browsing, reading and downloading eBooks stored in a Calibre database
GGUF Quantization support for native ComfyUI models
This repository offers various extension nodes for ComfyUI. Nodes here have different characteristics compared to those in the ComfyUI Impact Pack. The Impact Pack has become too large now...
A ComfyUI extension for Segment-Anything 2
Match two faces' shape before using other face swap nodes
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
一个支持部署多种 WebUI 的 Jupyter Notebook / 支持一键部署 SD-Trainer,InvokeAI,ComfyUI,SD WebUI 的 PowerShell 脚本
This is an improvement of the original GPT-SoVITS project, mainly focusing on the api.py. This improvement provide you the ability to change the GPT weight and the SoVITS weight while using the api…
一个高自由度的端到端的可定制AI-VTuber。支持对接哔哩哔哩直播间,以智谱API作为语言基座模型,拥有意图识别、长短期记忆(直接记忆和联想记忆),支持搭建认知库、歌曲作品库,接入了当前热门的一些语音转换、语音合成、图像生成、数字人驱动项目,并提供了一个便于操作的客户端。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)