Closed Quang-elec44 closed 8 months ago
Effectively the docker image doesn't bring the support fort Nvidia T4 GPUs.
Let me include this architecture in the coming maintenance release.
Will be fixed by this PR: https://github.com/huggingface/optimum-nvidia/pull/35
@Quang-elec44 we extended support to your device (and more) in the latest release 0.1.0b2, give it a try 🤗
@mfuntowicz Thank you so much. I'll try and report to you
Hi, I am currently testing with
TinyLlama/TinyLlama-1.1B-Chat-v0.3
model on NVIDIA Tesla T4 and the Docker image version is 0.1.0b1. Unfortunately, there is an error when doing inference, and here is the full error log. Can you help me out ? Thanks in advance.