This PR updates the nim_llama3.ipynb jupyter notebook to use 'InferenceAmiVersion': 'al2-ami-sagemaker-inference-gpu-2' within ProductionVariants inside EndpointConfig. This will make sure that newer driver is used on g5 and p4d/p4de instances instead of default 470 one and solve pynvml driver issues. It should fix this issue https://github.com/NVIDIA/nim-deploy/issues/98
This PR updates the nim_llama3.ipynb jupyter notebook to use 'InferenceAmiVersion': 'al2-ami-sagemaker-inference-gpu-2' within ProductionVariants inside EndpointConfig. This will make sure that newer driver is used on g5 and p4d/p4de instances instead of default 470 one and solve pynvml driver issues. It should fix this issue https://github.com/NVIDIA/nim-deploy/issues/98
cc: @JamesMaki @abhisheksawarkar