-
Complete error content:
![1d6c362c-c0b0-4a53-98a0-841d9762393c](https://github.com/user-attachments/assets/1bf78f47-c161-48a7-ae55-2db6507f8d4c)
Related toml files
![f1641c49-d4df-4fbb-805f-f51608b…
-
Sorry to bother you.When i try to python ./scripts/convert_controlnet_to_diffusers.py --checkpoint_path ../models/control_any3_openpose.pth --dump_path control_any3_openpose --device cpu. Something w…
-
**Describe the bug**
When using the Python bindings for Precise, I've noticed that the model predictions can vary substantially depending on where in the input audio the wake word is located. For exa…
-
run python scripts/txt2img.py --prompt "a photograph of an astronaut riding a horse" --plms error:
Can't load the model for 'openai/clip-vit-large-patch14'. If you were trying to load it from 'htt…
-
https://github.com/johko/computer-vision-course/blob/main/chapters/en/unit4/multimodal-models/transfer_learning.mdx
The notebook links in the Transfer Learning Application section don't go anywhere…
-
Does TensorRT support QAT&PTQ INT8 quantization of clip/vit models? Could you please provide any relevant quantization examples and accuracy & latency benchmark?
shhn1 updated
10 months ago
-
hello, thank you for your great work.
Seems like the proj in VDRImageEncoder isn't being updated? I don't think I saw anywhere where proj would be updated. And I also didn't see where embed(train=Tru…
-
Hi,
Thank you for your great work!
I've been trying to use the Phi-3-Instruct-4B VLM models, but encountered several issues:
- Incorrect LLM backbone choice in phi.py:
https://github.com/R…
-
-
Problems: Use demo to test action classification on kinetics-700 validation set but get very poor result
Experiment:
1. Pretrained model: https://huggingface.co/OpenGVLab/InternVideo2-Stage2_1B-2…