TXH-mercury / VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
https://arxiv.org/abs/2305.18500
MIT License
243 stars 17 forks source link

Inference code #7

Open 1980x opened 9 months ago

1980x commented 9 months ago

Hello. Thanks for awesome work and sharing the code.

Can you please share the inference/demo code? Thanks

Ijustakid commented 4 months ago

I need it too.

Thanks.