Skip to content
View YangShiJun-81's full-sized avatar

Block or report YangShiJun-81

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference

Python 140 9 Updated Oct 10, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 973 31 Updated Jul 31, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,214 375 Updated Dec 27, 2024

Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)

Jupyter Notebook 62 4 Updated Oct 19, 2024

Image Classification Testing with LLMs

Python 54 5 Updated Jan 18, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 20,931 2,301 Updated Aug 12, 2024

GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?

Python 209 25 Updated May 22, 2024
Python 10 4 Updated Nov 13, 2024

Code for Label Propagation for Zero-shot Classification with Vision-Language Models (CVPR2024)

Jupyter Notebook 35 3 Updated Jul 23, 2024

A curated list of resources for Partial-Multi-Label-Learning

21 5 Updated Dec 30, 2024

Exploring Structured Semantic Prior for Multi Label Recognition with Incomplete Labels [CVPR 2023]

Python 12 2 Updated Sep 23, 2023

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Python 606 65 Updated Jun 26, 2024

Multi-label Image Recognition with Partial Labels (IJCV'24, ESWA'24, AAAI'22)

Python 35 5 Updated Jul 15, 2024

[2024 ACM MM] Official PyTorch implementation of the paper "Text-Region Matching for Multi-Label Image Recognition with Missing Labels"

8 Updated Jul 23, 2024

This repo officially implements (IJCAI2024) TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt.

Python 4 Updated Sep 15, 2024
Python 2 Updated Aug 1, 2024
Python 89 8 Updated Sep 23, 2023

Code for Deep Quaternion Networks

TeX 55 11 Updated Dec 21, 2019

Implementation of SSPA

3 Updated Aug 6, 2024

Unofficial Implementation to CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification [ICCV'23]

Python 19 3 Updated May 30, 2024

[ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.

Python 65 1 Updated Jul 27, 2024

Collection of awesome test-time (domain/batch/instance) adaptation methods

805 55 Updated Dec 27, 2024

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…

Python 1,049 127 Updated Dec 20, 2024

Model calibration in CLIP Adapters

Python 13 Updated Aug 19, 2024

[ICML 2024] Official implementation for "Image Fusion via Vision-Language Model".

Python 48 1 Updated Jul 11, 2024

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.

Jupyter Notebook 179 11 Updated Jun 20, 2023

The official implementation of RAR

Python 78 Updated Mar 27, 2024

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".

Python 258 19 Updated Apr 3, 2024

[CVPR 2023] Effcient Frequence Domain-based Transformer for High-Quality Image Deblurring

Python 278 19 Updated Oct 9, 2023
Next