I am deploying RunPod serverless endpoints using the worker-infinity image, and querying using the OpenAI SDK (with Runpod API key as api_key and the runpod endpoint as the base_url.
However, some of the huggingface models require authentication, for instance the Nvidia NV Embed model .
I can see the following from the pod log:
Cannot access gated repo for url https://huggingface.co/nvidia/NV-Embed-v1/resolve/main/config.json. Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it.
First of all, thanks for the great work :)
I am deploying RunPod serverless endpoints using the worker-infinity image, and querying using the OpenAI SDK (with Runpod API key as
api_key
and the runpod endpoint as thebase_url
.However, some of the huggingface models require authentication, for instance the Nvidia NV Embed model .
I can see the following from the pod log:
Cannot access gated repo for url https://huggingface.co/nvidia/NV-Embed-v1/resolve/main/config.json. Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it.
Is there any support for overcoming this?