Stars
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
This is the code repo for ICCV23 paper Virtual Try-On with Garment-Pose Keypoints Guided Inpainting
Official code for "Style Aligned Image Generation via Shared Attention"
[Image and Vision Computing (Vol.147 Jul. '24)] Interactive Natural Image Matting with Segment Anything Models
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
This repository releases the code and data for utterance rewriting in open-domain dialogues.
[Information Fusion (Vol.103, Mar. '24)] Boosting Image Matting with Pretrained Plain Vision Transformers
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…
Official PyTorch implementation of "Loss-Curvature Matching for Dataset Selection and Condensation" (AISTATS 2023)
📋 A list of open LLMs available for commercial use.
Official repo for consistency models.
Nightly release of ControlNet 1.1
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
General technology for enabling AI capabilities w/ LLMs and MLLMs
Source code for Twitter's Recommendation Algorithm
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Kandinsky 2 — multilingual text2image latent diffusion model
Stable Diffusion with Core ML on Apple Silicon
Custom Script for Automatics1111 StableDiffusion-WebUI.
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing