-
I guess the performance of the original implementation, transformers implementation, and the timm may be different...Do you select the timm version because of that?
See [this issue](https://github.…
-
I am trying to finetune my model for a specific task using my own dataset. I have already format the dataset correctly according to the docs. Here I got weird error of `train.py: error: the following …
-
(main) (base) legion4080@LAPTOP-DTSSV3S1:~/MYXY/AI_pkgs/moondream$ python gradio_demo.py
Using device: cuda
If you run into issues, pass the `--cpu` flag to this script.
Special tokens have been …
-
**Is your feature request related to a problem? Please describe.**
I'm looking into testing SigLIP as an alternative text encoder. Is this supported or even possible to do? My knowledge is limited en…
-
The base `VisionBackbone` defined a forward method that accepts `pixel_values` as a Pytorch tensor:
https://github.com/TRI-ML/prismatic-vlms/blob/main/prismatic/models/backbones/vision/base_vision.…
-
Hi:
I use the LoRA to finetune my own models, but the results drop dramatically compared with the full finetuning conterparts.Here is the result table:
finetune-type | MME | GQA | MMBench | MM-Vet…
-
Is there a way to compare the embedding of the image to embedding of the text and calculate similarity score?
-
Thanks for your work
I would like to know which effect would be better between continuous fine-tuning and fine-tuning multiple instructions at once?
-
-