feature extraction for bounding boxes - a minor confusion

yeliudev / ConsNet

🚴‍♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)

GNU General Public License v3.0

33 stars 2 forks source link

Hi. Thanks for the great documentation on the README.

I went through your code for extracting bounding box features through RoIAlign. But, I am still confused. Lets say we already have bounding boxes in our hand which we got from faster-rcnn. Now, I would like to understand the process for a given single bounding box - what is the sequence of layers a given bounding box goes through inorder to arrive at the final 1024 dimensional feature vector for it? Would you please help me with the sequence of layers responsible for feature extraction process?

yeliudev / ConsNet

feature extraction for bounding boxes - a minor confusion #15