zjukg / Structure-CLIP

[Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations
https://arxiv.org/abs/2305.06152
112 stars 3 forks source link

scene graph #20

Closed AirPlanBird closed 1 month ago

AirPlanBird commented 4 months ago

Could you please explain the process or method used to generate the JSON file containing scene graph data? I am particularly interested in understanding the steps involved and any tools or algorithms used for this task

tonydavis629 commented 3 months ago

The scene graph data is generated using SceneGraphparse on COCO image captions