Closed jinmingteo closed 3 years ago
Yeah, this output seems odd to me. TIDE doesn't really work for very small changes because what affects AP is fairly complicated, but that change seems to have caused a large change of AP.
I guess the intuition that you can pull from this is that the change didn't actually affect any one category of error specifically, and just generally made the network better. If your change didn't target one particular subset of the error categories, then the overall AP is more meaningful.
thanks @dbolya! Will use overall AP as a first cut then TIDE main errors.
hi @dbolya,
i was testing out TIDE with 2 of my models (with slight different augmentations between them). The results are:
Model 1
Model 2
I am a little confused that the dAP (except Miss) Model 2 (with 45.71 AP) are significantly lower than Model 1 (with 50.43 AP).. Is there a good intuition or interpretation of the aforementioned results? I would think Model 1 is better (given its mAP) but TIDE seems to suggest otherwise.