InverseMatrixVT3D

InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction [Paper]

Abstract

This paper introduces InverseMatrixVT3D, an efficient method for transforming multi-view image features into 3D feature volumes for 3D semantic occupancy prediction. Existing methods for constructing 3D volumes often rely on depth estimation, device-specific operators, or transformer queries, which hinders the widespread adoption of 3D occupancy models. In contrast, our approach leverages two projection matrices to store the static mapping relationships and matrix multiplications to efficiently generate global Bird's Eye View (BEV) features and local 3D feature volumes. Specifically, we achieve this by performing matrix multiplications between multi-view image feature maps and two sparse projection matrices. We introduce a sparse matrix handling technique for the projection matrices to optimise GPU memory usage. Moreover, a global-local attention fusion module is proposed to integrate the global BEV features with the local 3D feature volumes to obtain the final 3D volume. We also employ a multi-scale supervision mechanism to enhance performance further. Comprehensive experiments on the nuScenes dataset demonstrate the simplicity and effectiveness of our method.

Method Pipeline

Getting Started

Visualization

Acknowledgement

Many thanks to these excellent projects:

Bibtex

If this work is helpful for your research, please consider citing the following BibTeX entry.

@article{ming2024inversematrixvt3d,
  title={InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction},
  author={Ming, Zhenxing and Berrio, Julie Stephany and Shan, Mao and Worrall, Stewart},
  journal={arXiv preprint arXiv:2401.12422},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
Figs		Figs
configs		configs
docs		docs
inversematrixvt3d		inversematrixvt3d
tools		tools
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InverseMatrixVT3D

Abstract

Method Pipeline

Getting Started

Visualization

Acknowledgement

Bibtex

About

Releases

Packages

Languages

DanielMing123/InverseMatrixVT3D

Folders and files

Latest commit

History

Repository files navigation

InverseMatrixVT3D

Abstract

Method Pipeline

Getting Started

Visualization

Acknowledgement

Bibtex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages