Open kaimingkuang opened 1 year ago
Sorry for the late reply.
Actually, the main backbone we use is the vit_b16 visual encoder, which achieves a higher accuracy than rn50 backbone. So you can try change the config file to vit_b16.yaml
.
And thanks for your attention.
Thanks for your amazing work! I was trying to run zero-shot classification of ScanObjectNN using the pipeline you provided. However, I only got 29.78% accuracy for the OBJ_ONLY variation (50.09% reported in the paper) using the following script:
The
test-file
argument specifies which variation to use. Am I missing something here? Thank you!