Authors: Deng-Ping Fan, Tao Zhou, Ge-Peng Ji, Yi Zhou, Geng Chen, Huazhu Fu, Jianbing Shen, and Ling Shao.
This repository provides code for "Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images" TMI-2020. (arXiv Pre-print & medrXiv & 中译版)
If you have any questions about our paper, feel free to contact us. And if you are using COVID-SemiSeg Dataset, Inf-Net or evaluation toolbox for your research, please cite this paper (BibTeX).
We elaborately collect COVID-19 imaging-based AI research papers and datasets awesome-list.
gt_path
Table of contents generated with markdown-toc
Figure 1. Example of COVID-19 infected regions in CT axial slice, where the red and green masks denote the
ground-glass opacity (GGO) and consolidation, respectively. The images are collected from [1].
[1] COVID-19 CT segmentation dataset, link: https://medicalsegmentation.com/covid19/, accessed: 2020-04-11.
Preview:
Our proposed methods consist of three individual components under three different settings:
Inf-Net (Supervised learning with segmentation).
Semi-Inf-Net (Semi-supervised learning with doctor label and pseudo label)
Semi-Inf-Net + Multi-Class UNet (Extended to Multi-class Segmentation, including Background, Ground-glass Opacities, and Consolidation).
Dataset Preparation:
Firstly, you should download the testing/training set (Google Drive Link)
and put it into ./Dataset/
repository.
Download the Pretrained Model:
ImageNet Pre-trained Models used in our paper (
VGGNet16,
ResNet, and
Res2Net),
and put them into ./Snapshots/pre_trained/
repository.
Configuring your environment (Prerequisites):
Note that Inf-Net series is only tested on Ubuntu OS 16.04 with the following environments (CUDA-10.0). It may work on other operating systems as well but we do not guarantee that it will.
Creating a virtual environment in terminal: conda create -n SINet python=3.6
.
Installing necessary packages: pip install -r requirements.txt
.
Installing THOP for counting the FLOPs/Params of model.
Figure 2. The architecture of our proposed Inf-Net model, which consists of three reverse attention
(RA) modules connected to the paralleled partial decoder (PPD).
Train
We provide multiple backbone versions (see this line) in the training phase, i.e., ResNet, Res2Net, and VGGNet, but we only provide the Res2Net version in the Semi-Inf-Net. Also, you can try other backbones you prefer to, but the pseudo labels should be RE-GENERATED with corresponding backbone.
Turn off the semi-supervised mode (--is_semi=False
) turn off the flag of whether use pseudo labels (--is_pseudo=False
) in the parser of MyTrain_LungInf.py
and just run it! (see this line)
Test
When training is completed, the weights will be saved in ./Snapshots/save_weights/Inf-Net/
.
You can also directly download the pre-trained weights from Google Drive.
Assign the path --pth_path
of trained weights and --save_path
of results save and in MyTest_LungInf.py
.
Just run it and results will be saved in ./Results/Lung infection segmentation/Inf-Net
Figure 3. Overview of the proposed Semi-supervised Inf-Net framework.
Data Preparation for a pseudo-label generation. (Optional)
Dividing the 1600 unlabeled image into 320 groups (1600/K groups, we set K=5 in our implementation),
in which images with *.jpg
format can be found in ./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/Imgs/
.
(I suppose you have downloaded all the train/test images following the instructions above)
Then you only just run the code stored in ./SrcCode/utils/split_1600.py
to split it into multiple sub-dataset,
which are used in the training process of pseudo-label generation. The 1600/K sub-datasets will be saved in
./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/DataPrepare/Imgs_split/
You can also skip this process and download them from Google Drive that is used in our implementation.
Generating Pseudo Labels (Optional)
After preparing all the data, just run PseudoGenerator.py
. It may take at least day and a half to finish the whole generation.
You can also skip this process and download intermediate generated file from Google Drive that is used in our implementation.
When training is completed, the images with pseudo labels will be saved in ./Dataset/TrainingSet/LungInfection-Train/Pseudo-label/
.
Train
Firstly, turn off the semi-supervised mode (--is_semi=False
) and turn on the flag of whether using pseudo labels
(--is_pseudo=True
) in the parser of MyTrain_LungInf.py
and modify the path of training data to the pseudo-label
repository (--train_path='Dataset/TrainingSet/LungInfection-Train/Pseudo-label'
). Just run it!
When training is completed, the weights (trained on pseudo-label) will be saved in ./Snapshots/save_weights/Inf-Net_Pseduo/Inf-Net_pseudo_100.pth
. Also, you can directly download the pre-trained weights from Google Drive. Now we have prepared the weights that is pre-trained on 1600 images with pseudo labels. Please note that these valuable images/labels can promote the performance and the stability of the training process, because of ImageNet pre-trained models are just designed for general object classification/detection/segmentation tasks initially.
Secondly, turn on the semi-supervised mode (--is_semi=True
) and turn off the flag of whether using pseudo labels
(--is_pseudo=False
) in the parser of MyTrain_LungInf.py
and modify the path of training data to the doctor-label (50 images)
repository (--train_path='Dataset/TrainingSet/LungInfection-Train/Doctor-label'
). Just run it.
Test
When training is completed, the weights will be saved in ./Snapshots/save_weights/Semi-Inf-Net/
.
You also can directly download the pre-trained weights from Google Drive.
Assign the path --pth_path
of trained weights and --save_path
of results save and in MyTest_LungInf.py
.
Just run it! And results will be saved in ./Results/Lung infection segmentation/Semi-Inf-Net
.
Here, we provide a general and simple framework to address the multi-class segmentation problem. We modify the original design of UNet that is used for binary segmentation, and thus, we name it as Multi-class UNet. More details can be found in our paper.
Figure 3. Overview of the proposed Semi-supervised Inf-Net framework.
Train
Just run MyTrain_MulClsLungInf_UNet.py
Note that ./Dataset/TrainingSet/MultiClassInfection-Train/Prior
is just borrowed from ./Dataset/TestingSet/LungInfection-Test/GT/
,
and thus, two repositories are equally.
Test
When training is completed, the weights will be saved in ./Snapshots/save_weights/Semi-Inf-Net_UNet/
. Also, you can directly download the pre-trained weights from Google Drive.
Assigning the path of weights in parameters snapshot_dir
and run MyTest_MulClsLungInf_UNet.py
. All the predictions will be saved in ./Results/Multi-class lung infection segmentation/Consolidation
and ./Results/Multi-class lung infection segmentation/Ground-glass opacities
.
We provide a one-key evaluation toolbox for LungInfection Segmentation tasks, including Lung-Infection and Multi-Class-Infection. Please download the evaluation toolbox Google Drive.
Prerequisites: MATLAB Software (Windows/Linux OS is both works, however, we suggest you test it in the Linux OS for convenience.)
run cd ./Evaluation/
and matlab
open the Matlab software via terminal
Just run main.m
to get the overall evaluation results.
Edit the parameters in the main.m
to evaluate your custom methods. Please refer to the instructions in the main.m
.
We also build a semi-supervised COVID-19 infection segmentation (COVID-SemiSeg) dataset, with 100 labelled CT scans from the COVID-19 CT Segmentation dataset [1] and 1600 unlabeled images from the COVID-19 CT Collection dataset [2]. Our COVID-SemiSeg Dataset can be downloaded at Google Drive.
[1]“COVID-19 CT segmentation dataset,” https://medicalsegmentation.com/covid19/, accessed: 2020-04-11. [2]J. P. Cohen, P. Morrison, and L. Dao, “COVID-19 image data collection,” arXiv, 2020.
Lung infection which consists of 50 labels by doctors (Doctor-label) and 1600 pseudo labels generated (Pseudo-label) by our Semi-Inf-Net model. Download Link.
Multi-Class lung infection which also composed of 50 multi-class labels (GT) by doctors and 50 lung infection labels (Prior) generated by our Semi-Inf-Net model. Download Link.
The Lung infection segmentation set contains 48 images associated with 48 GT. Download Link.
The Multi-Class lung infection segmentation set has 48 images and 48 GT. Download Link.
The download link (Google Drive) of our 638-dataset, which is used in Table.V of our paper.
== Note that ==: In our manuscript, we said that the total testing images are 50. However, we found there are two images with very small resolution and black ground-truth. Thus, we discard these two images in our testing set. The above link only contains 48 testing images.
To compare the infection regions segmentation performance, we consider the two state-of-the-art models U-Net and U-Net++. We also show the multi-class infection labeling results in Fig. 5. As can be observed, our model, Semi-Inf-Net & FCN8s, consistently performs the best among all methods. It is worth noting that both GGO and consolidation infections are accurately segmented by Semi-Inf-Net & FCN8s, which further demonstrates the advantage of our model. In contrast, the baseline methods, DeepLabV3+ with different strides and FCNs, all obtain unsatisfactory results, where neither GGO nor consolidation infections can be accurately segmented.
Lung infection segmentation results can be downloaded from this link
Multi-class lung infection segmentation can be downloaded from this link
Figure 4. Visual comparison of lung infection segmentation results.
Figure 5. Visual comparison of multi-class lung infection segmentation results, where the red and green labels
indicate the GGO and consolidation, respectively.
Ori GitHub Link: https://github.com/HzFu/COVID19_imaging_AI_paper_list
Figure 6. This is a collection of COVID-19 imaging-based AI research papers and datasets.
https://arxiv.org/pdf/2004.14133.pdf
Please cite our paper if you find the work useful:
@article{fan2020infnet,
author={Fan, Deng-Ping and Zhou, Tao and Ji, Ge-Peng and Zhou, Yi and Chen, Geng and Fu, Huazhu and Shen, Jianbing and Shao, Ling},
journal={IEEE Transactions on Medical Imaging},
title={Inf-Net: Automatic COVID-19 Lung Infection Segmentation From CT Images},
year={2020},
volume={39},
number={8},
pages={2626-2637},
doi={10.1109/TMI.2020.2996645}
}
The COVID-SemiSeg Dataset is made available for non-commercial purposes only. Any comercial use should get formal permission first.
You will not, directly or indirectly, reproduce, use, or convey the COVID-SemiSeg Dataset or any Content, or any work product or data derived therefrom, for commercial purposes.
We would like to thank the whole organizing committee for considering the publication of our paper in this special issue (Special Issue on Imaging-Based Diagnosis of COVID-19) of IEEE Transactions on Medical Imaging. For more papers refer to Link.
If you want to improve the usability of code or any other pieces of advice, please feel free to contact me directly (E-mail).
[ ] Support NVIDIA APEX
training.
[ ] Support different backbones ( VGGNet (done), ResNet, ResNeXt Res2Net (done), iResNet, and ResNeSt etc.)
[ ] Support distributed training.
[ ] Support lightweight architecture and faster inference, like MobileNet, SqueezeNet.
[ ] Support distributed training
[ ] Add more comprehensive competitors.
If the image cannot be loaded on the page (mostly in domestic network situations).
I tested the U-Net, however, the Dice score is different from the score in TABLE II (Page 8 of our manuscript).
Note that, our Dice score is the mean dice score rather than the max Dice score. You can use our evaluation toolbox Google Drive.
The training set of each compared model (e.g., U-Net, Attention-UNet, Gated-UNet, Dense-UNet, U-Net++, Inf-Net (ours)) is 48 images rather than 48 images + 1600 images.