Stars
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Visual tracking library based on PyTorch.
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
Protect your eyes from eye strain using this simple and beautiful, yet extensible break reminder
[ECCV2024 Oral🔥] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
Official code for Zero-Reference Low-Light Enhancement via Physical Quadruple Priors (CVPR-24)
FFCV: Fast Forward Computer Vision (and other ML workloads!)
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
"Retinexformer: One-stage Retinex-based Transformer for Low-light Image Enhancement" (ICCV 2023) & (NTIRE 2024 Challenge)
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Learners"
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
code for CVPR2024 paper: DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Object Recognition as Next Token Prediction (CVPR 2024 Highlight)
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
[ICCV 2023] FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light Vision
[WACV 2023] BoxMask: Revisiting Bounding Box Supervision for Video Object Detection
[ICCV 2023 oral] Official repository of the paper "Similarity Min-Max: Zero-Shot Day-Night Domain Adaptation"
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Repository containing a list of labelled/unlabelled nighttime datasets
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
We write your reusable computer vision tools. 💜
Official implementation of "Implicit Neural Representations with Periodic Activation Functions"
Meta-Transformer for Unified Multimodal Learning
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥