This repository contains the official implementation of the research paper, "MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training" CVPR 2024

Python 753 50 Updated Nov 22, 2024

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Jupyter Notebook 860 36 Updated Dec 10, 2024

ultralytics / ultralytics

Ultralytics YOLO11 🚀

Python 34,166 6,569 Updated Dec 18, 2024

WongKinYiu / yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,059 1,448 Updated Aug 9, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,407 1,418 Updated Sep 5, 2024

baaivision / EVA

EVA Series: Visual Representation Fantasies from BAAI

Python 2,348 169 Updated Aug 1, 2024

Sense-X / Co-DETR

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,058 123 Updated Nov 5, 2024

Beckschen / ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 184 6 Updated Jun 9, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,204 395 Updated Aug 7, 2024

xb534 / SED

[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.

Python 141 9 Updated May 30, 2024

w1oves / Rein

[CVPR 2024] Official implement of <Stronger, Fewer, & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic Segmentation>

Python 286 22 Updated Nov 12, 2024

UX-Decoder / Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,429 410 Updated Aug 19, 2024

amusi / CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

18,563 2,607 Updated Jul 4, 2024

yformer / EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,213 152 Updated Dec 17, 2024

Haiyang-W / GiT

[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Python 317 14 Updated Oct 7, 2024

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 2,475 196 Updated Dec 9, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,835 466 Updated Nov 5, 2024

we0091234 / crnn_plate_recognition

crnn chinese_plate_recognition

Python 309 61 Updated Nov 25, 2024

bar371 / GEFF

Official implementation of the paper GEFF: Improving Any Clothes-Changing Person ReID Model using Gallery Enrichment with Face Features.

Python 65 10 Updated Apr 28, 2024

tinyvision / SOLIDER

A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…

Python 1,923 346 Updated Jul 21, 2023

HarborYuan / ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 967 30 Updated Jul 31, 2024

OFA-Sys / ONE-PEACE

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 983 64 Updated Oct 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2016xjtuzyt

Block or report 2016xjtuzyt

Stars

tgf123 / YOLOv8_improve

sandipan211 / LoCATe-GAT

NMS05 / DinoV2-BERT-CLIP

UX-Decoder / DINOv

luca-medeiros / lang-segment-anything

UX-Decoder / Semantic-SAM

BryanPlummer / flickr30k_entities

siyuanliii / masa

apple / ml-mobileclip