"To label a clip, we consider all sound events within the 1s window. If an event overlaps with the window for more than 0.5s or half of the event duration, we add the corresponding class into the clip label. We then consider the number of classes within a clip as the
level of polyphony with the assumption that it is rare to have short non-overlapping events within a 1s window. "
"To label a clip, we consider all sound events within the 1s window. If an event overlaps with the window for more than 0.5s or half of the event duration, we add the corresponding class into the clip label. We then consider the number of classes within a clip as the level of polyphony with the assumption that it is rare to have short non-overlapping events within a 1s window. "
Can you provide a demo? Thank you very much!