Stars
PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal Data
Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
⚡ A template for rapid & flexible DL experimentation development, built upon Lightning & Hydra with best practice.
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
An open source implementation of CLIP.
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
The easiest way to visualize your computer vision models inferences.
A C++14-compatible physical units library with no dependencies and a single-file delivery option. Emphasis on safety, accessibility, performance, and developer experience.
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
MoViNets PyTorch implementation: Mobile Video Networks for Efficient Video Recognition;
A playbook for systematically maximizing the performance of deep learning models.
TensorDict is a pytorch dedicated tensor container.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
OpenMMLab Detection Toolbox and Benchmark
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
multi-task yolov5 with detection and segmentation
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Feathr – A scalable, unified data and AI engineering platform for enterprise
Simple image captioning model