ChenDelong1999 / RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)
https://arxiv.org/abs/2306.11029
Apache License 2.0
277 stars 18 forks source link

Box to caption #17

Closed lzwqjh closed 9 months ago

lzwqjh commented 9 months ago

Hi, I am very interested in your work. We have some remote sensing related data that need to be captioned. Could you please provide us with your specific box to caption code? I would be grateful if you could get a reply.

ChenDelong1999 commented 9 months ago

Unfortunately, we could not offer the codes, sorry. However, implementing B2C and M2B is quite straightforward. For re-implementation, you might refer to the Algorithm 1 in the arxiv v1 version: https://arxiv.org/pdf/2306.11029v1.pdf

image