Stars
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
A generative world for general-purpose robotics & embodied AI learning.
Utilities intended for use with Llama models.
A search engine dedicated to CS conferences. It provides useful filters for conferences and year range.
[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"
CUDA accelerated rasterization of gaussian splatting
a collection of AWESOME things about Optimal Transport in Deep Learning
A Unified Library for Parameter-Efficient and Modular Transfer Learning
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
3D Occupancy Prediction Benchmark in Autonomous Driving
3D Point Cloud Annotation Platform for Autonomous Driving
[ICRA2023] CoAlign: Robust Collaborative 3D Object Detection in Presence of Pose Errors
[CVPR2023 Highlight] The official codebase for paper "V2V4Real: A large-scale real-world dataset for Vehicle-to-Vehicle Cooperative Perception"
[ICCV 2023] OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception
Model interpretability and understanding for PyTorch
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
This repository is a paper digest of recent advances in collaborative / cooperative / multi-agent perception for V2I / V2V / V2X autonomous driving scenario.
Benchmarking Generalized Out-of-Distribution Detection
[CVPR 2022 Oral] Code release for "Causality Inspired Representation Learning for Domain Generalization"
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
Cross-platform, customizable ML solutions for live and streaming media.
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch
[TMLR 2022] High-Modality Multimodal Transformer
Unified multimodal classifier: a unified brain graph classification model trained on unpaired multimodal brain graphs, which can classify any brain graph of any size.
[NeurIPS 2021] Moment-DETR code and QVHighlights dataset
UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.