Stars
Using pytorch to implement MobileViT from Apple framework
A generative world for general-purpose robotics & embodied AI learning.
Official implementation of PointBeV: A Sparse Approach to BeV Predictions
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)
[ICCV2023] Official Implementation of "UniTR: A Unified and Efficient Multi-Modal Transformer for Bird’s-Eye-View Representation"
PyTorch implementation of MobileNetV4 family
Techniques for deep learning with satellite & aerial imagery
A Python program for downloading satellite imagery by geographic coordinates
Transportation planning and traffic simulation software for creating cities friendlier to walking, biking, and public transit
Epipolar Transformers (best paper award, CVPR 2020 workshop)
A curated list of awesome HD map construction methods
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
Source Code for Paper "OrienterNet Visual Localization in 2D Public Maps with Neural Matching"
SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images. http://panoptic-bev.cs.uni-freiburg.de
Source Code for "Map It Anywhere (MIA): Empowering Bird’s Eye View Mapping using Large-scale Public Data"
Sirlanri / Efficientvit
Forked from mit-han-lab/efficientvitEfficientViT is a new family of vision models for efficient high-resolution vision.
Efficient vision foundation models for high-resolution generation and perception.
Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention