Closed dongfeicui closed 5 months ago
You can change what encoders you want to use here. The default code will download image, video, depth and thermal encoder. There are some instructions about how to use different branch of LanguageBind if you only use a single modality.
Why do I download weights again every time I run inference.py ?