happyharrycn / actionformer_release

Code release for ActionFormer (ECCV 2022)
MIT License
415 stars 77 forks source link

Regarding Epic-Kitchen Combination of Noun and Verb #97

Closed HYUNJS closed 1 year ago

HYUNJS commented 1 year ago

Thank you for sharing your wonderful work! I would like to ask for the details of how did you combine noun and verb action proposals to achieve the reported performance of the action class in Epic-Kitchen. I checked that there were two issues already asked the same question, but I would appreciate it if you could share the exact implementation or the details to reproduce.

https://github.com/happyharrycn/actionformer_release/issues/7 https://github.com/happyharrycn/actionformer_release/issues/29#issuecomment-1134193785

For this part, you can have various choices to fuse these results. For example, you can get the noun/verb predictions for the same point, then take the noun or verb segment predictions as the final segment for this point, or you can simply take the average of these two segments. You may need to modify the code a little bit. We may update this part shortly.

happyharrycn commented 1 year ago

In our previous entry to the EPIC-Kitchen competition, we trained two separate localization models (one for noun and one for verb), and combined their results at inference time. This is done by the following steps.

HYUNJS commented 1 year ago

Thank you for your reply :)