Realistic endoscopic illumination modelling for NeRF-based data generation
You can download a copy of the corresponding MICCAI-23 paper from here
Important
Model weights have been added, see [ Download the pre-trained models](# download-the-pre-trained-models). C3VD trajectory paths to allow users to render datasets without downloading a copy of C3VD can be found under /data
C3VD updated its data and naming format. The REIM-NeRF was developed using the initial version of C3VD and therefore we follow the old naming convention. To allow users processing the new format of C3VD we have updated the pre-processing code to process both C3VD version and default to the latest format. If you have difficulties recreating the paper's results please read issue #1 of the repository
Expanding training and evaluation data is a major step towards building and deploying reliable localization and 3D reconstruction techniques during colonoscopy screenings. However, training and evaluating pose and depth models in colonoscopy is hard as available datasets are limited in size. This paper proposes a method for generating new pose and depth datasets by fitting NeRFs in already available colonoscopy datasets. Given a set of images, their associated depth maps and pose information, we train a novel light source location-conditioned NeRF to encapsulate the 3D and color information of a colon sequence. Then, we leverage the trained networks to render images from previously unobserved camera poses and simulate different camera systems, effectively expanding the source dataset. Our experiments show that our model is able to generate RGB images and depth maps of a colonoscopy sequence from previously unobserved poses with high accuracy.
The following script to downloads and extracts:
./ckpts/c3vd/main_results_iter15000_w270_h216/
.From the root directory of the repository, run:
python -m scripts/download_supporting_files.py
We host additional files supporting the paper in UCL's research repository. You can download a copy of all the pre-trained models from here and all normalized C3VD pose sequences from [here]() (data are uploaded to github until UCL storage approval is granted. )
If you want to render C3VD sequences without training models nor pre-processing C3VD, download an extract the data above either using the script or manually and continue reading from Render endoscopic sequences.
To train and test our models you need to have a copy of the registered videos of C3VD and based on it, generate the following:
Download a copy of all registered videos .zip files from C3VD and extract them under a single directory. After downloading and extracting all datasets, your local copy directory tree should look like this:
c3vd_registered_videos_raw
├── trans_t1_a_under_review
│ └── ...
├── trans_t2_b_under_review
│ ├── 0000_color.png
│ ├── 0000_depth.tiff
│ ├── 0000_normals.tiff
│ ├── 0000_occlusion.png
│ ├── ...
│ ├── coverage_mesh.obj
│ └── pose.txt
└── ...
Run the provided preprocessing script which will generate the training, evaluation and test datasets
python -m scripts.pre-process_c3vd\
{path_to_c3vd_registered_videos_raw dir}\
{path_to_a_directory_to_store_the_data}
The rest of the parameters should remain default.
By default the script is set to process the latest C3VD format ( as of 11/23). If you are using the under review version of C3VD, which was used to develop REIM-NeRF, append the above command with --old_C3VD
. Additionally if you want to overwrite existing data add the --overwrite
flag.
After both steps are complete, the process dataset should have the following structure
processed
├── trans_t1_a/(t1v1 for the new version or trans_t1_a_under_review)
│ ├── images
│ │ ├── 0000_color.png
│ │ ├── 0001_color.png
│ │ └── ....
│ ├── distmaps
│ │ ├── 0000_color.png
│ │ ├── 0001_color.png
│ │ └── ....
│ ├── rgb_mask.png
│ ├── transforms_test.json
│ ├── transforms_train.json
│ ├── transforms_true_test.json
│ └── transforms_val.json
└── ....
We provide scripts to train all variants of models presented in the paper, across all C3VD sequences. This allows readers to recreate the ablation study presented in the paper.
train_nerf.sh
, train_nerf_depth.sh
, train_nerf_plus_light-source.sh
, train_reim-nerf.sh
under REIM-NeRF/scripts/bash/c3vd/training
, by replacing the placeholder value of variable dataset_root_dir
with the path of the root directory of the pre-processed C3VD dataset (generated in step 1).train_all.sh
: trains all models. It runs the following 4 scripts in sequencetrain_nerf.sh
: trains only the vanilla nerftrain_nerf_depf.sh
: trains only the vanilla nerf with sparse depth supervisiontrain_nerf_plus_light-source.sh
: trains the light-source location conditioned model.train_reim-nerf.sh
: trains the light-source conditioned model with sparse depth supervision. This is our full modelThe provided scripts assume will work with models and trajectories downloaded using the provided scripts. If you download the models and trajectory paths using the provided python script, skip steps 1 and 2, the hardcoded paths in the scripts of step 3 should work.
inference_nerf.sh
, inference_nerf_depth.sh
, inference_nerf_plus_light-source.sh
, inference_reim-nerf.sh
under REIM-NeRF/scripts/bash/c3vd/inference
, by replacing the placeholder value of variable sequences_root_dir
with the path of the root directory of the pre-processed C3VD dataset (generated in step 1). Furthermore, replace the checkpoints_root_dir
with the root directory of saved models for C3VD for each of the models.inference_all.sh
: run inference on C3VD with all models. This script runs the following 4 scripts in sequenceinference_nerf.sh
: Inference with the original nerf modelinference_nerf_depf.sh
: Inference with the original nerf model, trained with sparse depth supervisioninference_nerf_plus_light-source.sh
: Inference with the light-source location conditioned model.inference_reim-nerf.sh
: Inference with the light-source location conditioned model, trained with sparse depth supervision. This is our full modelscripts/bash/c3vd/evaluate/evaluate_template.sh
, by changing the placeholders variables paths. dataset_root_dir
should point to the pre-processed version of C3VD and predictions_root_dir
should point to the root directory containing checkpoints for all c3vd dataset of a specific model variant.This research was funded, in whole, by the Wellcome/EPSRC Centre for Interventional and Surgical Sciences (WEISS) [203145/Z/16/Z]; the Engineering and Physical Sciences Research Council (EPSRC) [EP/P027938/1, EP/R004080/1, EP/P012841/1]; the Royal Academy of Engineering Chair in Emerging Technologies Scheme, and Horizon 2020 FET (863146). For the purpose of open access, the author has applied a CC BY public copyright licence to any author accepted manuscript version arising from this submission.
We also thank kwea123, who open-sourced a multi-GPU implementation of NeRF upon which we build our approach.
If you found this work usefull in your research, consider citing our paper.
@inproceedings{psychogyios2023realistic,
title={Realistic Endoscopic Illumination Modeling for NeRF-Based Data Generation},
author={Psychogyios, Dimitrios and Vasconcelos, Francisco and Stoyanov, Danail},
booktitle={International Conference on Medical Image Computing and Computer-Assisted Intervention},
pages={535--544},
year={2023},
organization={Springer}
}