shvdiwnkozbw / Multi-Source-Sound-Localization

This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
79 stars 15 forks source link

random ciou calculation #2

Closed realshijy closed 4 years ago

realshijy commented 4 years ago

Hi there,

may i ask have you tried the ciou@0.5 accuracy result when always choosing the whole image region as positive? I found it quite high compared with experiment result. (60.8% for my test data, which is 250 samples randomly chosen from 5k annotated sound flickr dataset in Learning to Localize Sound Source in Visual Scenes CVPR18

realshijy commented 4 years ago

I just set new/main.py function cal_ciou parameter thres as 0 Thank you in advance )

shvdiwnkozbw commented 4 years ago

The accuracy result under threshold 0 is closely related to the sounding area of test data. If the test data contain a large amount of audiovisual pairs where the sounding areas occupy nearly the whole region in the image, then setting thres as 0 will result in very high accuracy. Therefore, sometimes the quantitative result cannot well reflect the model's ability for localization, but the qualitative visualizations make more sense.

realshijy commented 4 years ago

@shvdiwnkozbw thank you