-
The University of Texas at Austin
- Austin, TX
Stars
AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions
Label your dataset with active learning in FiftyOne!
Perform visual question answering on your images
Rich-cli is a command line toolbox for fancy output in the terminal
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
[CVPR2023] Official Implementation of "DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets"
Here's how to use Lama3 for beginners and what services are being used.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The devkit of the nuScenes dataset.
nutonomy / second.pytorch
Forked from traveller59/second.pytorchPointPillars for KITTI object detection
OpenMMLab's next-generation platform for general 3D object detection.
[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection & [ICCV2023] PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Face analysis tools for modern research, equipped with state-of-the-art Face Parsing and Face Alignment
An easy-to-use Python library for processing and manipulating 3D point clouds and meshes.
A python tool for fitting primitives 3D shapes in point clouds using RANSAC algorithm
Images to inference with no labeling (use foundation models to train supervised models).
A Graph-Based Approach for Category-Agnostic Pose Estimation [ECCV 2024]
[CVPR 2023] BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization, CVPR 2022
Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything