The example now follows the same formatting as the rest of the Vertex AI example Jupyter Notebooks within this repository, keeping the section for the quota increase request, adding some disclaimer messages when needed (deployment time, gated access, or MESSAGES_API_ENABLED upcoming support).
Description
This PR ports the example on how to deploy Meta Llama 3.1 405B Instruct FP8 developed for this blog post, and previously hosted in
alvarobartt/meta-llama-3-1-on-vertex-ai
.The example now follows the same formatting as the rest of the Vertex AI example Jupyter Notebooks within this repository, keeping the section for the quota increase request, adding some disclaimer messages when needed (deployment time, gated access, or
MESSAGES_API_ENABLED
upcoming support).