I really appreciate your wonderful work and nice idea!
I'm now faced with some problems when trying to extract ego4d (clip - text) pairs data.
The narration.json only has "timestamp_sec", "timestamp_frame" of a specific clip without "start_time" and "end_time", I wonder how do you decide the interval of the clips?
My method is reranking the clips narration by "timestamp_sec" , and decide the interval of the i-th clip is just [i-th timestamp_sec,i+1-th timestamp_sec]. Is it correct?Does anyone know about it?
Thanks.
I really appreciate your wonderful work and nice idea! I'm now faced with some problems when trying to extract ego4d (clip - text) pairs data. The narration.json only has "timestamp_sec", "timestamp_frame" of a specific clip without "start_time" and "end_time", I wonder how do you decide the interval of the clips? My method is reranking the clips narration by "timestamp_sec" , and decide the interval of the i-th clip is just [i-th timestamp_sec,i+1-th timestamp_sec]. Is it correct?Does anyone know about it? Thanks.