Skip to content

A Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion Architecture for LiDAR-Based Collective Perception

License

Notifications You must be signed in to change notification settings

ekut-es/MR3D-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception

by Sven Teufel, Jörg Gamerdinger, Georg Volk, and Oliver Bringmann

Overview

This repository contains the official code for the ITSC 2024 paper MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception. MR3D-Net is a multi-resolution sparse voxel grid fusion architecture for LiDAR-based collective perception. MR3D-Net achieves state-of-the-art results on the OPV2V Benchmark while reducing the required bandwidth by up to 94%. As an environment representation MR3D-Net soley relies on sparse voxel grids at different resolutions that can be dynamically assigned to adapt to the limitations in wireless communication. Sparse voxel grids can not only drastically reduce the required bandwidth, but also provide a unified and interchangable representation of the environment.

MR3D-Net Architecture

Getting Started

Requirements

This code was tested on Linux with

  • Python 3.11
  • PyTorch 2.1.2
  • CUDA 11.8

Installation

Create a virtual environment

python3.11 -m venv mr3d_net

and activate it by running

source mr3d_net/bin/activate

clone this repository

git clone https://github.com/ekut-es/MR3D-Net.git
cd MR3D-Net

Install dependecies by running

pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/cu118
pip install torch-scatter -f https://data.pyg.org/whl/torch-2.1.2+cu118.html
pip install spconv-cu118
pip install -r requirements.txt

Install this repository by running

python setup.py develop

In case CUDA is not found set CUDA_HOME before running setup.py

export CUDA_HOME=/usr/local/cuda-11.8

Data Preparation

Download OPV2V and organize it as follows:

MR3D-Net
├── data
│   ├── OPV2V
│   │   │── train
│   │   │── validate
│   │   │── test

Generate the dataset infos by running

python pcdet/datasets/opv2v/openv2v_dataset.py

Training

MR3D-Net can be configured in: /tools/cfgs/opv2v_models/mr3d-net.yaml

The voxel grid resolutions as well as the assignment method can be configured in: /tools/cfgs/dataset_configs/opv2v_dataset.yaml

To train MR3D-Net run

cd tools
python train_mr3d_net.py --extra_tag "name of experiment"

Testing

To test MR3D-Net run

python test.py --ckpt "path to checkpoint"

For more details on training and testing configuration please have a look at train.py and test.py

MR3D-Net Results on OPV2V

Car [email protected] Bandwidth@10Hz [Mbit/s] Model Checkpoint
No Fusion 76.7 - download
Low Resolution 82.1 4.3 download
Medium Resolution 83.2 8.8 download
High Resolution 83.9 14.4 download
Dynamic Resolution 82.4 9.2 download
Mean Features (Dynamic Resolution) 81.7 42.2 download

Single Resolution 3D Net (SR3D-Net)

We also provide a more lightweight backbone for non-dynamic single resolution sparse voxel grid fusion with only one input stream at the respective resolution called SR3D-Net. SR3D-Net achieves similar results as MR3D-Net with single resolutions. SR3D-Net can be configured in cfgs/opv2v_models/sr3d-net.yaml.

To train MR3D-Net run

cd tools
python train_mr3d_net.py --cfg_file "cfgs/opv2v_models/sr3d-net.yaml" --extra_tag "name of experiment"

Testing

To test MR3D-Net run

python test.py --cfg_file "cfgs/opv2v_models/sr3d-net.yaml" --ckpt "path to checkpoint"

For more details on training and testing configuration please have a look at train.py and test.py

MR3D-Net Results on OPV2V

Car [email protected] Bandwidth@10Hz [Mbit/s] Model Checkpoint
High Resolution 83.6 14.4 download

Citation

If you find this project useful in your research, please consider citing:

@misc{teufel2024mr3dnet,
      title={MR3D-Net: Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion for LiDAR-Based Collective Perception}, 
      author={Sven Teufel and Jörg Gamerdinger and Georg Volk and Oliver Bringmann},
      year={2024},
      eprint={2408.06137},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2408.06137}, 
}

OpenPCDet

This project is basen upon OpenPCDet
If you find this project useful, please consider also citing:

@misc{openpcdet2020,
    title={OpenPCDet: An Open-source Toolbox for 3D Object Detection from Point Clouds},
    author={OpenPCDet Development Team},
    howpublished = {\url{https://github.com/open-mmlab/OpenPCDet}},
    year={2020}
}

About

A Dynamic Multi-Resolution 3D Sparse Voxel Grid Fusion Architecture for LiDAR-Based Collective Perception

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published