Skip to content
View 2016xjtuzyt's full-sized avatar

Block or report 2016xjtuzyt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 72 8 Updated Dec 18, 2024

Official PyTorch implementation of the IEEE TETCI 2024 paper LoCATe-GAT

Python 2 Updated Nov 30, 2024

A simple PyTorch implementation of CLIP model using DinoV2 and BERT

Python 10 1 Updated Sep 26, 2023

[CVPR 2024] Official implementation of the paper "Visual In-context Learning"

Python 418 19 Updated Apr 8, 2024

SAM with text prompt

Python 1,805 199 Updated Nov 19, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,411 118 Updated Jul 19, 2024

Flickr30K Entities Dataset

MATLAB 168 26 Updated Dec 23, 2018

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Python 1,062 69 Updated Nov 7, 2024

This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 753 50 Updated Nov 22, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Jupyter Notebook 860 36 Updated Dec 10, 2024

Ultralytics YOLO11 🚀

Python 34,166 6,569 Updated Dec 18, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,059 1,448 Updated Aug 9, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,407 1,418 Updated Sep 5, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,348 169 Updated Aug 1, 2024

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,058 123 Updated Nov 5, 2024

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 184 6 Updated Jun 9, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,204 395 Updated Aug 7, 2024

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Python 141 9 Updated May 30, 2024

[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>

Python 286 22 Updated Nov 12, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,429 410 Updated Aug 19, 2024

CVPR 2024 论文和开源项目合集

18,563 2,607 Updated Jul 4, 2024

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,213 152 Updated Dec 17, 2024

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 317 14 Updated Oct 7, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 2,475 196 Updated Dec 9, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,835 466 Updated Nov 5, 2024

crnn chinese_plate_recognition

Python 309 61 Updated Nov 25, 2024

Official implementation of the paper GEFF: Improving Any Clothes-Changing Person ReID Model using Gallery Enrichment with Face Features.

Python 65 10 Updated Apr 28, 2024

A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…

Python 1,923 346 Updated Jul 21, 2023

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 967 30 Updated Jul 31, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 983 64 Updated Oct 6, 2024
Next