roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

https://inference.roboflow.com

Other

1.3k stars 116 forks source link

Paligemma infer #436

Closed probicheaux closed 4 months ago

probicheaux commented 4 months ago

Description

Adds Paligemma as an ort route so finetunes can be deployed with inference

BIg change for paligemma, this moves it from a core model to a subclass of RoboflowInferenceModel so the weights now come through /ort/. This pr relies on https://github.com/roboflow/roboflow/pull/3491

Type of change

Please delete options that are not relevant.

[ ] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds functionality)
[ ] This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

Locally

Any specific deployment considerations

Ok, core paligemma is supported under paligemma-3b-mix-224

Docs

[ ] Docs updated? What were the changes: