Closed Zhang-GK closed 3 years ago
Thanks for your attention for our work.
As for the label of relation, you can refer Large-Scale-VRD for details.
As for the label of visual object, you can refer Faster-rcnn for details.
What's more, we utilize the relation embedding to capture the visual relationship, rather than the textual label. Since the relation embedding contains more visual objects information, which can be viewed as a more soft strategy to construct visual relationship.
I get it!Thanks again!
The triplet is expressed as embeddings. Is there any way to get the textual form? Thanks in advance.