This PR contains the review plus enhancement of some of the previously existing examples within this repository for Vertex AI, so that the formatting is aligned with the recently included examples, and also to add / enhance the content when needed, while also adding missing pieces and reproducing those with the latest containers.
Here's a simple checklist with the things pending review, but ideally most of the content is already there so the remaining should be light work:
[x] TEI in Vertex AI (from Hub and from GCS)
[x] TGI in Vertex AI (from Hub and from GCS)
[x] PyTorch Inference in Vertex AI (relies on huggingface-inference-toolkit wheel)
The rest of the examples related to the PyTorch Training container are tackled in #44
Notes
gemma-finetuning-clm-lora-sft.ipynb has been removed since it was not directly related to Vertex AI, and is already covered in #44
deploy-mistral-on-vertex-ai.ipynb has been removed in favour of deploy-gemma-on-vertex-ai.ipynb, since having both at the current stage won't add any value as the only diff between those is the MODEL_ID provided to TGI
Description
This PR contains the review plus enhancement of some of the previously existing examples within this repository for Vertex AI, so that the formatting is aligned with the recently included examples, and also to add / enhance the content when needed, while also adding missing pieces and reproducing those with the latest containers.
Here's a simple checklist with the things pending review, but ideally most of the content is already there so the remaining should be light work:
huggingface-inference-toolkit
wheel)The rest of the examples related to the PyTorch Training container are tackled in #44
Notes
gemma-finetuning-clm-lora-sft.ipynb
has been removed since it was not directly related to Vertex AI, and is already covered in #44deploy-mistral-on-vertex-ai.ipynb
has been removed in favour ofdeploy-gemma-on-vertex-ai.ipynb
, since having both at the current stage won't add any value as the only diff between those is theMODEL_ID
provided to TGI