TXH-mercury / VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
https://arxiv.org/abs/2304.08345
MIT License
259 stars 16 forks source link

Code to perform QA task #25

Closed Dewmi24 closed 5 months ago

Dewmi24 commented 7 months ago

Does anybody have an inference code or notebook to run VALOR for QA task? any notebooks at least for information retrieval

TXH-mercury commented 5 months ago

A simple Inference code is added