clin1223 / VLDet

[ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)
Other
177 stars 11 forks source link

Inference code #4

Open ash80 opened 1 year ago

ash80 commented 1 year ago

Does VLDet support a simple script where I supply an image and a vocabulary file (and possibly embeddings) and get the bounding boxes or segmentation mask as the output?

clin1223 commented 1 year ago

Hi,

Thanks for your interest. The code now is not supporting it. We have a plan to release such a script later. Or. you can just generate the custom vocabulary with RegionCLIP tools and inplace the text embedding.

Thanks!

--Chuang

quantingxie commented 1 year ago

Hi,

Do you have this simple script now such that we could test the with our own images?