roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
https://inference.roboflow.com
Other
1.3k stars 116 forks source link

Paligemma infer #436

Closed probicheaux closed 4 months ago

probicheaux commented 4 months ago

Description

Adds Paligemma as an ort route so finetunes can be deployed with inference

BIg change for paligemma, this moves it from a core model to a subclass of RoboflowInferenceModel so the weights now come through /ort/. This pr relies on https://github.com/roboflow/roboflow/pull/3491

Type of change

Please delete options that are not relevant.

How has this change been tested, please provide a testcase or example of how you tested the change?

Locally

Any specific deployment considerations

Ok, core paligemma is supported under paligemma-3b-mix-224

Docs