LinWeizheDragon / FLMR

The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
71 stars 4 forks source link

Results compare with InternVit #27

Open lucasjinreal opened 3 months ago

lucasjinreal commented 3 months ago

Hi, would love to see some comparsion to the SOTA vit models such as InternVit and Siglip etc, especially for the Chinese version.

LinWeizheDragon commented 3 months ago

Thank you for your suggestions. We will consider adding such baselines to the benchmark when we release the Chinese version.