beeldengeluid / dane-video-segmentation-worker

0 stars 0 forks source link

Manage edge-cases for spectograms (out of scope) #12

Closed Veldhoen closed 1 year ago

Veldhoen commented 1 year ago

When a keyframe is very close to the beginning or end of a video, a (symmetrical) one second window of audio cannot be creared. Moreover, when a keyframe is close to a shot boundary, a one second window may be inappropriate. However, the feature extraction model requires homogeneous, one second-based spectograms.

We discussed several solutions:

The second approach (apply padding, in a circular fashion) is deemed most appropriate. However, due to time constraints we stick with the last approach (discarding edge frames) at least for the video boundaries, as a minimum effort solution.

NB: the same holds for the annotations a researcher uses as query for similarity search!