fredzzhang / pvic

[ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"
BSD 3-Clause "New" or "Revised" License
63 stars 8 forks source link

Swin backbone trained weights #56

Open YangJae96 opened 1 week ago

YangJae96 commented 1 week ago

Hi. Thank you for your great work

Is only ResNet-50 backbone available for inference? Could you please provided the Swin-L backbone model weights if possible?!

I would like the evaluate the results and use it for custom data for inference!

Thanks in advance.

fredzzhang commented 1 week ago

Hi @YangJae96,

Unfortunately we did not save the checkpoints with larger backbones due to storage reasons. You can find the fine-tuned object detector with Swin-L here. With this, you should be able to train the HOI detector yourself. Using the --use-checkpoint flag significantly reduces memory requirement.

Let me know if you have trouble reproducing the results.

Cheers, Fred.