Skip to content
View imba-pericia's full-sized avatar

Block or report imba-pericia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)

Jupyter Notebook 1,719 100 Updated Oct 10, 2024

for tile the image for advanced control or modification

Python 362 9 Updated Nov 20, 2024

Desktop system for creators with a focus on simplicity, elegance, and usability. Based on FreeBSD. Less, but better!

2,321 57 Updated Jul 9, 2024

SOTA Open Source TTS

Python 16,940 1,264 Updated Dec 13, 2024

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Python 7,269 361 Updated Dec 10, 2024

this extension implements custom nodes that integreated ImageMagick into ComfyUI

Python 31 3 Updated Apr 19, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 481 41 Updated Jul 26, 2024

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…

Python 1,003 119 Updated Nov 26, 2024

Official repository of In-Context LoRA for Diffusion Transformers

1,316 65 Updated Nov 17, 2024

Synchronized Translation for Videos. Video dubbing

Python 915 171 Updated Oct 23, 2024

Lifting ControlNet for Generalized Depth Conditioning

Python 24 1 Updated Dec 28, 2023

ROCm Container 6.2 with PyTorch 2.4 for ComfyUI with RX570/RX580/RX590 aka Polaris AMD GPU Support

Dockerfile 8 3 Updated Oct 17, 2024

A fast multimodal LLM for real-time voice

Python 1,581 106 Updated Dec 12, 2024

🦙 Ollama Telegram bot, with advanced configuration

Python 306 84 Updated Nov 11, 2024

A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.

Python 1,644 97 Updated Dec 12, 2024

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Python 11,030 1,240 Updated Dec 13, 2024

Daily hacker news top stories. Subscribe the hackernews daily top stories by watching this repo.

Python 178 12 Updated Aug 13, 2024

Evaluating text-to-image/video/3D models with VQAScore

Python 238 21 Updated Sep 9, 2024

Updated Fusion-io iomemory VSL Linux (version 3.2.16) driver for recent kernels.

C 156 28 Updated Nov 2, 2024

A repo to store files I share in video.

Python 45 11 Updated Dec 12, 2024
Python 8 Updated Oct 19, 2024

Particle systems! Optical flow! Temporal masks! For ComfyUI!

Python 328 20 Updated Nov 25, 2024

An implementation of Depthflow in ComfyUI

Python 191 7 Updated Oct 21, 2024
Next