Stars
Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"
Search docs.voxel51.com with an LLM!
[CVPR 2024 Highlight] OpenBias: Open-set Bias Detection in Text-to-Image Generative Models
Using Low-rank adaptation to quickly fine-tune diffusion models.
Refine high-quality datasets and visual AI models
official code for "Large Language Models as Optimizers"
[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
[ICCV 2023] Prompt-aligned Gradient for Prompt Tuning
[CVPR 2023] Feature Alignment and Uniformity for Test Time Adaptation
Unified Controllable Visual Generation Model
A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled concept data
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
Official PyTorch implementation of POEM (Partial Observation Experts Modelling) as introduced in the paper Contrastive Meta-Learning for Partially Observable Few-Shot Learning
Official repository of "Diffusion-based Image Translation using Disentangled Style and Content Representation" ( ICLR 2023 )
We developed a python UI based on labelme and segment-anything for pixel-level annotation. It support multiple masks generation by SAM(box/point prompt), efficient polygon modification and category…
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
Source code for NeurIPS 2020 paper "Meta-Learning with Adaptive Hyperparameters"
PyTorch implementation of "Learning Deep Features for Discriminative Localization"
Transformers as Meta-Learners for Implicit Neural Representations, in ECCV 2022
Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning, in ICCV 2021
Implementation of NFormer: Robust Person Re-identification with Neighbor Transformer
implement of SwiftNet:Real-time Video Object Segmentation