wisdomikezogwo / quilt1m

[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.
https://quilt1m.github.io/
MIT License
138 stars 8 forks source link

Can you provide more details of the implementation/training scripts of QuiltNet? #32

Open PixelChen24 opened 2 months ago

PixelChen24 commented 2 months ago

Nice work! And I'd like to do some research based on your code. Can you provide more details about training? For example, the text encoder(tokenizer, max_token_length).

I've set the parameter as you specified in the Tab.8 of your paper. But I get really bad results. I think it maybe due to the text encoder setup, which is default text encoder of CLIP with vocabulary ~48000 and max length 77. However, this cannot suit the need of medical image. So I think it would be better for the community to reproduce your results if you can provide more details about the training process. Thank you very much!