beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"
Apache License 2.0
620 stars 30 forks source link

Question about training data? #2

Closed BIGBALLON closed 6 months ago

BIGBALLON commented 6 months ago

Hi, @beichenzbc Thank you for your amazing work on Long-CLIP, which has greatly inspired me to improve CLIP's max token limitation and enhance image-retrieval performance.

I have two questions to discuss with you:

  1. According to the paper, long and short captions are needed for training Long-CLIP. However, I noticed that ShareGPT-4V only provides long captions. How can we generate or obtain short captions?
  2. Do you have any plans to release the Long-CLIP training code?

Thank you.