smujiang / WSIPenMarkingRemoval

Remove pen-marking annotations on whole slide images(WSIs) for data achiving.
MIT License
16 stars 1 forks source link
deep-learning digital-pathology gan image-inpainting image-normalization image-restoration

Deep learning models are showing promise in digital pathology to aid diagnoses. Training complex models require a significant amount and diversity of well-annotated data, typically housed in institutional archives. These slides often contain clinically meaningful markings to indicate regions of interest. If slides are scanned with the ink present, then the downstream model may end up looking for regions with ink before making a classification. If scanned without the markings, the information is lost about where the relevant regions are located.

In this repo, we proposed a straightforward framework to digitally remove ink markings from whole slide images using a conditional generative adversarial network, opening the possibility of using archived clinical samples as resources to fuel the next generation of deep learning models for digital pathology.
Please read our paper to get more details. If you find this repo helps you, please cite our work:

Jiang, Jun, Naresh Prodduturi, David Chen, Qiangqiang Gu, Thomas Flotte, Qianjin Feng, and Steven Hart. "Image-to-image translation for automatic ink removal in whole slide images." Journal of Medical Imaging 7, no. 5 (2020): 057502.

Here is an example of WSI showing the clean scan, marked slides and image after restoration.
Thumbnail level Here are some high resolution image patches to show performance of image restoration . High resolution patches In order to evaluate the effectiveness of our workflow, violin plots were drawn to enrich our assessment. We can observe: First, the distributions of all evaluation metrics for tissue_only patches are symmetrical, which indicates that our model preserves the image content if no ink presents. Second, for patches with ink (inked_tissue and ink_only), all three evaluation metrics move to higher values, which indicates ink was effectively removed. Particularly, for image patches with ink, SSIM distributions were dispersive before ink removal (which may be caused by uneven thickness of ink), but aggregate to higher values, which indicates inked image patches were effectively restored. Quantitative evaluation result

Installation

Run our workflow

References

[1] S. Ali, N. K. Alham, C. Verrill, and J. Rittscher, "Ink removal from histopathology whole slide images by combining classification, detection and image generation models," arXiv preprint arXiv:1905.04385, 2019.

[2] Venkatesh, Bairavi, Tosha Shah, Antong Chen, and Soheil Ghafurian. "Restoration of marker occluded hematoxylin and eosin stained whole slide histology images using generative adversarial networks." arXiv preprint arXiv:1910.06428 (2019).