Description
I was trying to host https://huggingface.co/microsoft/Florence-2-base-ft using Triton's Python-based vLLM backend and encounter an error (Unrecognized configuration class <class 'transformers_modules.microsoft.Florence-2-base.ee1f1f163f352801f3b7af6b2b96e4baaa6ff2ff.configuration_florence2.Florence2Config'> to build an AutoTokenizer.)
Triton Information
Images: nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3
I pulled the images from the Nvidia image repository and use it as it is
To Reproduce
Set up a model repo directory: ~/work/model_repository/florence-2-base-ft/1
Downloaded model.json and config.pbtxt as suggested in the tutorial
Description I was trying to host https://huggingface.co/microsoft/Florence-2-base-ft using Triton's Python-based vLLM backend and encounter an error (Unrecognized configuration class <class 'transformers_modules.microsoft.Florence-2-base.ee1f1f163f352801f3b7af6b2b96e4baaa6ff2ff.configuration_florence2.Florence2Config'> to build an AutoTokenizer.)
Triton Information Images: nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3 I pulled the images from the Nvidia image repository and use it as it is
To Reproduce
instance_group [ { count: 1 kind: KIND_MODEL } ]