πππOur paper has been accepted at NeurIPS 2024 as a spotlight!
The SegVol is a universal and interactive model for volumetric medical image segmentation. SegVol accepts point, box and text prompt while output volumetric segmentation. By training on 90k unlabeled Computed Tomography (CT) volumes and 6k labeled CTs, this foundation model supports the segmentation of over 200 anatomical categories.
We have released SegVol's inference code, training code, model params and ViT pre-training params (pre-training is performed over 2,000 epochs on 96k CTs).
Keywords: 3D medical SAM, volumetric image segmentation
The pytorch v1.11.0 (or a higher version) is needed first. Following install key requirements using commands:
pip install 'monai[all]==0.9.0'
pip install einops==0.6.1
pip install transformers==4.18.0
pip install matplotlib
How to use our pre-trained ViT as your model encoder.
πThe 25 processed datasets are being uploaded to ModelScope/ιζη€ΎεΊ and HuggingFace.
https://github.com/BAAI-DCAI/SegVol/assets/60123629/242a1578-e418-463c-9d53-a62eeb154c7d
We performed an external validation experiment using a novel annotated dataset from the ULS23 Challenge (750 + 744 + 124 cases about lesions) and the validation dataset from Amos22 (120 cases about organs). SegVol showed strong segmentation abilities compared to other medical SAM methods in accurately segmenting lesions and 15 important organs.
(2024.01.03) A radar map about zero-shot experiment has been reported. π
(2023.12.25) Our web tool supports download results now! You can use it as an online tool. π₯π₯π₯
(2023.12.15) The training code has been uploaded!
(2023.12.04) A web tool of SegVol is here! Just enjoy it! π₯π₯π₯
(2023.11.28) Our model and demo case have been open-source at huggingface/BAAI/SegVol. π€π€
(2023.11.28) The usage of pre-trained ViT has been uploaded.
(2023.11.24) You can download weight files of SegVol and ViT(CTs pre-train) from huggingface/BAAI/SegVol or Google Drive. π₯π₯π₯
(2023.11.23) The brief introduction and instruction have been uploaded.
(2023.11.23) The inference demo code has been uploaded.
(2023.11.22) The first edition of our paper has been uploaded to arXiv. π
If you find this repository helpful, please consider citing:
@article{du2023segvol,
title={SegVol: Universal and Interactive Volumetric Medical Image Segmentation},
author={Du, Yuxin and Bai, Fan and Huang, Tiejun and Zhao, Bo},
journal={arXiv preprint arXiv:2311.13385},
year={2023}
}
@misc{bai2024m3dadvancing3dmedical,
title={M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models},
author={Fan Bai and Yuxin Du and Tiejun Huang and Max Q. -H. Meng and Bo Zhao},
year={2024},
eprint={2404.00578},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2404.00578},
}
Thanks for the following amazing works:
CLIP.
Image by brgfx on Freepik.
Image by muammark on Freepik.