Depth estimation in colonoscopy images provides geometric clues for downstream medical analysis tasks, such as polyp detection, 3D reconstruction, and diagnosis. Recently, deep learning technology has made significant progress in monocular depth estimation for nat- ural scenes. However, without sufficient ground truth of dense depth maps for colonoscopy images, it is signicantly challenging to train deep neural networks for colonoscopy depth estimation. In this paper, we pro- pose a novel approach that makes full use of both synthetic data and real colonoscopy videos.We use synthetic data with ground truth depth maps to train a depth estimation network with a generative adversarial network model. Despite the lack of ground truth depth, real colonoscopy videos are used to train the network in a self-supervision manner by exploiting temporal consistency between neighboring frames. Furthermore, we design a masked gradient warping loss in order to ensure temporal consis- tency with more reliable correspondences. We conducted both quantita- tive and qualitative analysis on an existing synthetic dataset and a set of real colonoscopy videos, demonstrating the superiority of our method on more accurate and consistent depth estimation for colonoscopy images.
[In MICCAI 2021].
We will clean the code and write more detailed instruction soon.
bash ./scripts/test.sh
):
#!./scripts/test.sh
python syn_test.py --name colon2depth_512p --no_instance --label_nc 0
bash ./scripts/train.sh
):
#!./scripts/train.sh
python train.py --name colon2depth_512p --batchSize 8 --gpu_ids 1,2 --label_nc 0 --no_instance --tf_log --no_vgg_loss --continue_train```
If you find this useful for your research, please use the following.
Kai. Cheng, Yiting. Ma, Yang. Li, Bin. Sun and Xuejin. Chen. "Depth Estimation for Colonoscopy Images with
Self-supervised Learning from Videos", Medical Image Computing and Computer Assisted Intervention Society, 2021
This code borrows from NVIDIA/pix2pixHD.