Skip to content
View hao416's full-sized avatar

Block or report hao416

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,245 112 Updated Dec 20, 2023

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 57,812 5,898 Updated Aug 24, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 26,908 3,397 Updated Jul 23, 2024

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,331 266 Updated Jul 31, 2024

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,129 86 Updated Oct 21, 2024

深度学习经典、新论文逐段精读

27,790 2,481 Updated Nov 17, 2024

《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning

HTML 17,564 3,583 Updated Oct 7, 2022

Deep Learning Book Chinese Translation

TeX 35,955 9,114 Updated Dec 3, 2019

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,166 722 Updated Aug 12, 2024

Natural Language Processing Tutorial for Deep Learning Researchers

Jupyter Notebook 14,380 3,951 Updated Feb 21, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,481 856 Updated Jan 6, 2025

(TPAMI 2024) A Survey on Open Vocabulary Learning

875 50 Updated Dec 10, 2024

Collection of AWESOME vision-language models for vision tasks

2,696 228 Updated Dec 3, 2024

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,148 105 Updated Aug 19, 2022

Reading list for research topics in multimodal machine learning

6,199 859 Updated Aug 20, 2024

计算机视觉相关综述。包括目标检测、跟踪........

1,973 244 Updated Jan 10, 2025

awesome grounding: A curated list of research papers in visual grounding

1,044 98 Updated Apr 9, 2023

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 500 79 Updated Jun 25, 2024

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Python 2,070 215 Updated Aug 15, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,472 27,526 Updated Jan 11, 2025

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Python 14,179 2,082 Updated Jul 24, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 85,772 23,091 Updated Jan 12, 2025

OpenMMLab Detection Toolbox and Benchmark

Python 30,018 9,505 Updated Aug 21, 2024

Flops counter for convolutional networks in pytorch framework

Python 2,848 306 Updated Sep 27, 2024

Model analyzer in PyTorch

Python 1,469 142 Updated Mar 19, 2023

The dataset for drone based detection and tracking is released, including both image/video, and annotations.

1,374 165 Updated Sep 24, 2023

deep learning for image processing including classification and object-detection etc.

Python 23,788 8,068 Updated Jan 12, 2025

Count the MACs / FLOPs of your PyTorch model.

Python 4,937 529 Updated Jul 8, 2024

2021秋招 计算机视觉算法岗面经整理——包含实习和校招等 内推整理

780 120 Updated May 11, 2021
Next