Closed anjugopinath closed 2 years ago
I think yes.
we have illustrated the feature representation via t-SNE in Figure 10 and Figure 11 which include the features of each object under the same verb category (the same edges in each cluster in Figure 10 (b) and Figure 11(c)). However, we do not illustrate the same object (bottle) with different verbs. But we think it is also possible.
It might be easy to achieve this according to "extract_HO_feature.py", feats_tsne.py, extract_HO_feature_fake.py, feats_tsne_fake.py. feats_tsne_fake.py is the code for illustrating object features. I guess it is easy to change to illustrate verb features according to object category.
Regards
emm, I guess misunderstand your points. Do you mean select out the images according to the object category and verb category?
You can select it according to the annotations. https://drive.google.com/file/d/1M4j5-rHcdfHYVfHQToccO0SsEGP4nGC1/view?usp=sharing. This include verb id and object id for each annotation (box pair).
Thank You so much for your reply.
This is one of the entries: {"image_id": 408, "category_id": 1, "bbox": [216.0, 46.0, 350.0, 354.0], "iscrowd": 0, "id": 1157}
Could you tell me what "category_id" and "id" stands for please? Similarly, what does "iscrowd" stand for?
Does this contain entries for both HICO-DET and HOI-COCO dataset? Do you have a script to download specific images? My end goal is to download all images containing a certain object - for example, download all images associated with object name 'cup' , 'donut' etc.
Thank You in advance!
Also, is it possible to download training images by object category?
Sorry for confusing you. For HICO-DET, the provided file is wrong. Here (https://drive.google.com/drive/folders/1vIDhLwpVSTQ1aPNQh_Qz4xXU8D98TYRD) provides the annotation for verb category and object category.
For example, in this item,
{"file_name": "HICO_train2015_00000001.jpg", "img_id": 1, "annotations": [{"bbox": [207, 32, 426, 299], "category_id": 1}, {"bbox": [58, 97, 571, 404], "category_id": 4}], "hoi_annotation": [{"subject_id": 0, "object_id": 1, "category_id": 73, "hoi_category_id": 153}, {"subject_id": 0, "object_id": 1, "category_id": 77, "hoi_category_id": 154}, {"subject_id": 0, "object_id": 1, "category_id": 88, "hoi_category_id": 155}]}
the items in annotations represent the object category. The items in hoi_annotation mean the hoi category (subject (that's fixed to person), object, category_id( verb id)). In hoi category subject_id and object_id is the index of corresponding object in hoi_annotation.
For the images of HICO-DET, you can download from https://drive.google.com/open?id=1QZcJmGVlF9f4h-XLWe9Gkmnmj2z1gSnk.
For V-COCO, you can convert the annotation according to https://github.com/hitachi-rd-cv/qpic#v-coco.
Thank You so much. This answers my questions I believe.
I downloaded the HICO-DET dataset. Is it possible to view the images by category. For ex) all images with bottles bottle - drink with bottle - hold bottle - carry
etc ?