YueLiao / PPDM

Code for "PPDM: Parallel Point Detection and Matching for Real-time Human-Object Interaction Detection".
MIT License
219 stars 42 forks source link

About the generation of "corre_.npy" #17

Closed BozhouZha closed 4 years ago

BozhouZha commented 4 years ago

Hi YueLiao,

I'm so sorry for bothering you again. I wondered may I have the "corre_vcoco.npy" file if it's at your convenience? I found that just like the hico dataset, the program also relies on the binary mask to do the validation or testing. Annotation alone cannot bring me there. Thank you in advance!

Bozhou

YueLiao commented 4 years ago

You can generate the "npy" by yourself following the below rules: The "corre*.npy" is generated by the "train.json" annotation file. If an interaction is defined between ith class object and jth verb in training set, the value at location (i,j) of "corre_.npy" is 1 else 0.

BozhouZha commented 4 years ago

You can generate the "npy" by yourself following the below rules: The "corre.npy" is generated by the "train.json" annotation file. If an interaction is defined between ith class object and jth verb in training set, the value at location (i,j) of "corre_*.npy" is 1 else 0.

Thanks, I'll try it out. Have a good one~

BozhouZha commented 4 years ago

You can generate the "npy" by yourself following the below rules: The "corre.npy" is generated by the "train.json" annotation file. If an interaction is defined between ith class object and jth verb in training set, the value at location (i,j) of "corre_*.npy" is 1 else 0.

Hi YueLiao,

Thanks again for the way you pointed out yesterday. But I'm still trying to figure the mask out. I'm a newbie at this so this might be a dumb question:

The "i-th" and "j-th" you mentioned is not likely the numerical category label of the object and the verb, right? Take the object for example, the largest categorical index is 90 but the shape of the corre_hico is <117, 80>, which means 80 masks available for object series. It makes sense since some objects are not documented and the faultage exists in those 90 indices.

I guess there might be a list of unordered category ids for the objects and verbs respectively, and the index of the items is the exactly "i" & "j" you mentioned. So I wonder which kind of mapping I'm supposed to be using, or could you please give me some hints how to make the mapping on my own?

I appreciate your help so much!

YueLiao commented 4 years ago

You can map the 90 categories into 80 categories through the label list in dataset/hico.py first, then generating the corre.npy like this

BozhouZha commented 4 years ago

You can map the 90 categories into 80 categories through the label list in dataset/hico.py first, then generating the corre.npy

Get, thanks bro!