Stars
Magnificent app which corrects your previous console command.
Rich is a Python library for rich text and beautiful formatting in the terminal.
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Convert Machine Learning Code Between Frameworks
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
OpenMMLab's next-generation platform for general 3D object detection.
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Python package for the evaluation of odometry and SLAM
StudioGAN is a Pytorch library providing implementations of representative Generative Adversarial Networks (GANs) for conditional/unconditional image generation.
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
Papers and Datasets about Point Cloud.
Pytorch framework for doing deep learning on point clouds.
SECOND for KITTI/NuScenes object detection
PyTorch implementation of Pointnet2/Pointnet++
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
OpenMMLab 3D Human Parametric Model Toolbox and Benchmark
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
[CVPR 2022 Oral, Best Student Paper] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)