How to inference on my own image and text？

henghuiding / Vision-Language-Transformer

[ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation

MIT License

338 stars 21 forks source link

Open kelisiya opened 2 years ago

changliu19 commented 2 years ago

Hi,

Please see #3 and the input format in