Closed idejie closed 2 years ago
Hello @idejie, thank you for reaching out!
positive_clips
refers to the clips that contain a state change. Similarly, negative_clips
means the clips that do not have a state change. Further, @fuqichen1998 will be able to give you more idea about pre_pnr_post_frames
.
I would suggest you to please have a look at this (https://github.com/EGO4D/hands-and-objects/blob/main/state-change-localization-classification/i3d-resnet50/configs/defaults.py) file for a detailed description of various configuration terms.
Feel free to reach out to us if there is any doubt!
Regards, Siddhant Bansal
Thanks a lot for your reply, @Sid2697 ! The repo is quite awesome! @fuqichen1998 !
I will carefully study the codes, and thanks for your advice!
And I still have some questions:
Thanks again!
Thank you @idejie!
Here are answers to your questions:
Regards, Siddhant Bansal
Thanks for your answers @Sid2697 ! And sorry for the unclear expression for Q2.
For these 8s clips, I know negative samples don't contain any state change frame. The PNR localization is to find the beginning frame of an object state change in positive samples, and the state change may be related to an action. It's like the task of temporal action localization, and one baseline in this repo is based on a temporal action localization method (boundary matching network).
For the temporal action localization task, a clip may contain several actions, and some methods can find their boundaries(start and end of an action). But for the PNR localization, does a positive sample contain other actions whose beginnings are not PNR?
Best Wishes, Dejie Yang
Thank you @idejie for detailed description of the question!
For the PNR localisation task, a positive sample contains only one action. The data selection and annotation have been carried out to have one action (and the corresponding PNR frame) per clip.
Let me know if there is any confusion!
Regards, Siddhant Bansal
Thanks for your answer @Sid2697 !
I think you have solved my questions!
Best Wishes, Dejie Yang
Hi, there are many configurations, maybe under the preprocessed data, like
positive_clips
,pre_pnr_post_frames
? How can I process the data to get these splits? Is there a more detailed document?