Stars
A feature-rich command-line audio/video downloader
Tensors and Dynamic neural networks in Python with strong GPU acceleration
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Anomaly detection related books, papers, videos, and toolboxes
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Most popular metrics used to evaluate object detection algorithms.
Torchreid: Deep learning person re-identification in PyTorch.
Fast and accurate automatic speech recognition (ASR) for edge devices
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
PyTorch open-source toolbox for unsupervised or domain adaptive object re-ID.
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
Resources for our paper "Low-Light Image and Video Enhancement: A Comprehensive Survey and Beyond"