MonoRec - Dense Reconstruction

This is a clone of the MonoRec repository with changes to run inference and train on Euroc type datasets, specifically the TUM-VI dataset.

The tum-vi dataset is a visual inertial dataset that contains sequences recorded from a handheld setup consisting of a stereo setup with two cameras(fisheye lens). The images are grayscale. It also provide synchronised IMU data(ggyro and accel).

The primary additions are:

Custom dataloader for tum-vi/euroc-format datasets
Alternate script for viewing pointclouds using the Open3D library

TUM-VI dataloader

The tum-vi dataloader has been written in a way so that it expects the dataset to be in a specific format as shown below:

dataset-dir
    ├── 00
    |    ├── basalt_keyframe_data
    |    │   ├── keypoints
    |    │   ├── keypoints_viz
    |    │   └── poses
    |    ├── dso
    |    │   ├── cam0
    |    │   │   └── images -> ../../mav0/cam0/data
    |    │   └── cam1
    |    │       └── images -> ../../mav0/cam1/data
    |    ├── mav0
    |        ├── cam0
    |        │   └── data
    |        ├── cam1
    |        │   └── data
    |        ├── imu0
    |        └── mocap0
    ├── 01
    ...

The overall pipeline of dataloading goes as follows:

Load camera intrinsics for each sequence
Format the intrinsics according to the target image size
Load the poses, left stereo images, right stereo images and sparse depth keypoints
- The primary key is the poses i.e. only those timestamps for which keyframe pose is available is included in the dataset
- Poses are loaded and stored directly in the meory on intialization
- Stereoimages and keypoints paths are stored on initialization and are accessed from the memory only during the _get_item() call
accesing images:
- convert to 3-channel image
- image is first resized (if applicable) and then cropped to the target image size
accessing keypoints:
- .txt file containing the keypoints is read
- check for invalid entry i.e. nans or index out of bounds of the original image size
- scale the keypoints according to the target image size and add to depth tensor
- crop the depth tensor to target image size

**Note: python dictionaries have been used for the above implementation. Good references for efffective dataloader implementations [ref1] [ref2]

PointCloud Visualization using open3d

The rgbd2pcl.py script is used to generate and view pointclouds from the keyframe, predicted depth, camera intrinsics and extrinsics. It also saves the keyframes and the predicted depth maps in the save directory mentioned in the config file(can be used for debugging). It uses Open3d for the same. [ref1][ref2]

Make sure to activate the conda environment(monorec with open3d installation):

conda activate pcl

E.g.

python3 rgbd2pcl.py --config configs/test/pointcloud_monorec_euroc.json

Inference:

The example-tumvi folder can be used to test the forward pass using the tum-vi dataloader. The test_monorec.py script can be used to test inference on an entire dataset i.e. with multilpe sequences, and the test_monorec_seq.py can be used to test inference on a single sequence.

Make sure to activate the conda environment for both inference and training using:

conda activate monorec

Usage:

python3 test_monorec.py

**set pretrain_mode=1 to just evaluate the depth module without using the mask module

Pointcloud generation:

To evaluate the model, a pointcloud can be generated. CloudCompare was used for viewing the generated pointclouds. Either rgbd2pcl.py or create_pointcloud.py can be used. Usage of rgbd2pcl.py is mentioned above.

Usage for create_pointcloud.py:

python create_pointcloud.py --config configs/test/pointcloud_monorec_tumvi.json

Training:

**Note:
Change Ubuntu GUI mode for better speed during training [ref1][ref2]
good practices for training on multiple GPUs [ref]

Run the following commands:

python train.py --config configs/train/monorec/monorec_depth_tumvi.json --options stereo                          # Depth Bootstrap
python train_monorec.py --config configs/train/monorec/monorec_mask_tumvi.json --options stereo                   # Mask Bootstrap
python train_monorec.py --config configs/train/monorec/monorec_mask_ref_tumvi.json --options mask_loss            # Mask Refinement
python train_monorec.py --config configs/train/monorec/monorec_depth_ref_tumvi.json --options stereo stereo_repr  # Depth Refinement

To monitor the training using tensorboard, set the parameter tensorboard to true in the config, and run the command below in a separate terminal:

MonoRec$ tensorboard --logdir=saved/log/monorec_depth/00

Important Hyperparameters for TUM-VI/RealSense-Bag:

Some hyperparameters needed to be tuned differently for the TUM-VI dataset or the dataset recorded using the RealSense from the ones used in the paper for the KITTI dataset.

The inv_depth_min_max parameter must be set to (1.0, 0.0025) for training as the dataset has been recorded using a hand-held device as opposed to a device mounted on a car(KITTI).
The step_size and gamma parameters of the lr_scheduler must be properly tuned keeping in mind the size of the dataset.
The parameter alpha which is reponsible for assigning weight to the sparse_depth_loss and the self_supervision_loss(combination of photometric_inconsistency_cv and edge_aware_smoothness_loss) must be set properly after observing the intermediate results during training.
The num_workers and batch_size parameters must be set considering the compute power, size of dataset etc. [ref1] [ref2]

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
base		base
configs		configs
data_loader		data_loader
evaluater		evaluater
example-tumvi		example-tumvi
example		example
logger		logger
model		model
pictures		pictures
trainer		trainer
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
MONOREC_README.md		MONOREC_README.md
README.md		README.md
create_pointcloud.py		create_pointcloud.py
download_model.sh		download_model.sh
environment.yml		environment.yml
evaluate.py		evaluate.py
parse_config.py		parse_config.py
rgbd2pcl.py		rgbd2pcl.py
train.py		train.py
train_monorec.py		train_monorec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MonoRec - Dense Reconstruction

TUM-VI dataloader

PointCloud Visualization using open3d

Inference:

Pointcloud generation:

Training:

Important Hyperparameters for TUM-VI/RealSense-Bag:

About

Releases

Packages

Languages

License

RobotVisionHKA/MonoRec

Folders and files

Latest commit

History

Repository files navigation

MonoRec - Dense Reconstruction

TUM-VI dataloader

PointCloud Visualization using open3d

Inference:

Pointcloud generation:

Training:

Important Hyperparameters for TUM-VI/RealSense-Bag:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages