sStonemason / RET-CLIP

RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports
21 stars 2 forks source link
foundation-models fundus-image-analysis medical-imaging vision-language-model

RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports

Accepted by MICCAI2024.

This code repository is based on the official Chinese-CLIP(LINK)

| Paper |

News

Environments

To start with this project, make sure that your environment meets the requirements below:

python >= 3.6.4 pytorch >= 1.8.0 (with torchvision >= 0.9.0) CUDA Version >= 10.2

Run the following command to install required packages.

pip install -r requirements.txt

Pretrained Model

If you encounter any issue while downloading or using the pretrained model, please feel free to contact us.

Vision Backbone Text Backbone
ViT-b-16 RoBERTa-wwm-ext-base-chinese LINK