jhc13 / taggui

Tag manager and captioner for image datasets
GNU General Public License v3.0
666 stars 31 forks source link

Model Request: Moondream #54

Closed Goldenkoron closed 6 months ago

Goldenkoron commented 7 months ago

Please add support for this model. https://github.com/vikhyat/moondream

An extra idea which may be feasible or unfeasible (I do not know) is maybe speculative decoding using a smaller model like this. https://arxiv.org/abs/2310.07177

My experience with speculative decoding in LLMs at least is that it greatly speeds up inference time, and perhaps doing the same thing with like cogvlm as a main model and moondream as a speculative decoding model could speed up captioning of large datasets.

jhc13 commented 6 months ago

The model has been added in v1.17.0.

An extra idea which may be feasible or unfeasible (I do not know) is maybe speculative decoding using a smaller model like this.

This is difficult to implement and will not be added at this time.