Stars
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Effortless data labeling with AI support from Segment Anything and other awesome models.
Character Animation (AnimateAnyone, Face Reenactment)
Unofficial Implementation of Animate Anyone
Efficient vision foundation models for high-resolution generation and perception.
The collection of pre-trained, state-of-the-art AI models for ailia SDK
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Unofficial implementation of InstantID for ComfyUI
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
A collection of ComfyUI custom nodes.- Awesome smart way to work with nodes!
Based on GroundingDino and SAM, use semantic strings to segment any element in an image. The comfyui version of sd-webui-segment-anything.
A distilled Segment Anything (SAM) model capable of running real-time with NVIDIA TensorRT
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
Improved AnimateAnyone implementation that allows you to use the opse image sequence and reference image to generate stylized video
[ICCV 2023] MI-GAN: A Simple Baseline for Image Inpainting on Mobile Devices