deeplearning-wisc / stud

source code for CVPR'22 paper "Unknown-Aware Object Detection: Learning What You Don’t Know from Videos in the Wild"
Apache License 2.0
117 stars 14 forks source link

issues about visualization #4

Closed captain0305 closed 2 years ago

captain0305 commented 2 years ago

Hi!

I have some problem with visualization using model model_final_resnet_bdd.

using code

image

Results are showing the det boxes on the img of all id classes. But there's no ood boxes showing on the img. How could I tell apart the id and old object using this repo?

Looking forward to your reply!

captain0305 commented 2 years ago

I have solve this issue with this code

but I still have some question with this issue.

inter_feat = results.inter_feat[0:max_boxes]
    print(inter_feat)
    print(torch.logsumexp(inter_feat[:, :-1], dim=1).cpu().data.numpy())
    print((np.argwhere(
            torch.logsumexp(inter_feat[:, :-1], dim=1).cpu().data.numpy() < energy_threshold)).reshape(-1))
    if energy_threshold:
        labels[(np.argwhere(
            torch.logsumexp(inter_feat[:, :-1], dim=1).cpu().data.numpy() < energy_threshold)).reshape(-1)] = 8

What is the inter_feat? It has 11 dimension. And after calculating , it is being used to tell apart the ood and id obj. Is this just match the step in the paper below?

image

And I wonder why inter_feat has 11 dimension and the meaning of each dimension. And also why the pred_cls_probs has 11 dimension? I just using the model model_final_resnet_bdd, and it's config just have 8 categories. I just quite confused about that.

Looking forward to your reply

d12306 commented 2 years ago

hi, @captain0305 , sorry for the late reply! i am really busy recently with experiments.

1) yes, the inter_feat here is the logits.

2) actually bdd100k has 10 classes plus a background class. here we use the 11-bit logit for training but essentially we only train with 8 classes. this is the legacy code used in https://github.com/xinw1012/cycle-confusion. you can definitely use 8-bit. i suppose the result to be similar.

d12306 commented 2 years ago

@captain0305 , feel free to post more questions here if needed! i will check more frequently from now on

captain0305 commented 2 years ago

Thank u for your reply! I have trained my data using stud, and have some benchmarks.

BigBuffa1o commented 1 year ago

@captain0305 Hello,by applying your code on a single test img,should I modify the original yaml file to get a successful output like you?