Image-to-text generation

wisdomikezogwo / quilt1m

[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.

MIT License

138 stars 8 forks source link

Hi,

To use QuiltNet for retrieval, I'd suggest you use https://github.com/LAION-AI/CLIP_benchmark, which we also leveraged in the evaluation.

Also for Text-generation tasks we recently released a new work called Quilt-LLAVA where we essentially use the image tower of Quiltnet in training Quilt-LLAVA. Fortunately, The model should be either tonight or tomorrow night. With that Quilt-LLAVA, you could conduct research with an LMM tuned for histopathology. Please read the paper when you have some time.

wisdomikezogwo / quilt1m

Image-to-text generation #17