Lists (1)
Sort Name ascending (A-Z)
Stars
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
for tile the image for advanced control or modification
Desktop system for creators with a focus on simplicity, elegance, and usability. Based on FreeBSD. Less, but better!
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
this extension implements custom nodes that integreated ImageMagick into ComfyUI
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…
Official repository of In-Context LoRA for Diffusion Transformers
Synchronized Translation for Videos. Video dubbing
Lifting ControlNet for Generalized Depth Conditioning
ROCm Container 6.2 with PyTorch 2.4 for ComfyUI with RX570/RX580/RX590 aka Polaris AMD GPU Support
🦙 Ollama Telegram bot, with advanced configuration
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。
Daily hacker news top stories. Subscribe the hackernews daily top stories by watching this repo.
Evaluating text-to-image/video/3D models with VQAScore
Updated Fusion-io iomemory VSL Linux (version 3.2.16) driver for recent kernels.
A repo to store files I share in video.
Particle systems! Optical flow! Temporal masks! For ComfyUI!
An implementation of Depthflow in ComfyUI