zhuoxiao-chen / ReDB-DA-3Ddet

[ICCV 2023] Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and Class-balanced Pseudo-Labeling
Apache License 2.0
46 stars 8 forks source link

Confusion about details of downsampling #2

Closed HuizaiVictorYao closed 1 year ago

HuizaiVictorYao commented 1 year ago

Hi, your excellent work has brought me great inspiration. After looking through your paper, I wonder how you perform downsampling in stage 2 in detail. I mean that if we want to "uniformly downsample" a set of pseudo labels to obtain a subset with a length of \hat{B}/t, we can just randomly choose \hat{B}/t box predictions from the original box prediction set, then what does inverse KDE do in this process? Or “uniform downsampling" is just a proper noun of 3d detection?

Thanks for your reply in advance.

zhuoxiao-chen commented 1 year ago

Hi,

Appreciate your time in reading our work.

We are randomly choosing box predictions but guided by the probability estimated by KDE. Objects with higher OBCs values are assigned higher probability to be chosen.

I will delve into OBC-based downsampling from two perspectives:

  1. Why? Higher OBC scores signify the rarity, diversity, and uniqueness of predicted objects. In reality, most objects are common and occur frequently, thus producing lower OBC scores. We aim for the detector to learn from a varied set of pseudo-labeled objects, not biased to the majority common objects with low OBCs. Hence, we require pseudo-labels with uniformly distributed OBC scores, leading us to uniformly downsample based on OBC scores across all box predictions.
  2. How? Initially, we estimate the OBC score distribution of all box predictions via Kernel Density Estimation (KDE). The density of low OBC scores is notably high (due to large number of common objects), while that of high OBC scores is low. By inverting this distribution, the density of high OBC scores becomes high. This altered distribution allows us to assign higher probabilities to pseudo-labeled objects with high OBCs. Small number of high OBC objects are given high probability to be sampled, while the large number of low OBC objects are given low probability to be sampled. As a result, the downsampled set contains pseudo-labeled objects with more evenly distributed OBC scores.
image

I hope this addresses your question.

Regards, Zhuoxiao (Ivan)

HuizaiVictorYao commented 1 year ago

I got it. Thanks for your prompt reply!