Closed BlackPuuuudding closed 6 months ago
No. For InBedding, please refer to text_grounding_net.py.
Then, what is hico_det_clip? As I know, it is from the GLIGEN(ckpt). But, original GLIGEN couldn't give the HICO-DET tsv. So can you share with us?
Is the embedding of the hico_det_clip dataset obtained through InBedding as described in the paper?