huggingface / Google-Cloud-Containers

Hugging Face Deep Learning Containers (DLCs) for Google Cloud
https://hf.co/docs/google-cloud
Apache License 2.0
131 stars 18 forks source link

Fix `entrypoint.sh` and `huggingface-inference-toolkit` version in PyTorch Inference DLCs #120

Closed alvarobartt closed 1 week ago

alvarobartt commented 1 week ago

Description

This PR fixes the version of huggingface-inference-toolkit pinned within both DLCs:

As the huggingface-inference-toolkit version was pinned to a more recent one (0.4.2) instead of 0.4.1, that's due to the version pinning on huggingface-inference-toolkit that happened before releasing it publicly.

Additionally, this PR also adds the exec before the uvicorn command in the entrypoint.sh files so as to fix the signal handling for gracefully shutting it down.

Finally, this PR also installs the correct version of huggingface_hub before installing the huggingface-inference-toolkit to prevent the latest from being installed, which deprecates the cached_download method used by other dependencies.