Open alexanderdicke-webcom opened 2 weeks ago
Hey @alexanderdicke-webcom! Are you using the docker image with version 2.0.4
or have you built it locally to that version? I haven't seen this error before and don't have a 8xH100 handy; do you get the same issue on 8xA100?
Hey @LysandreJik! We are using the official docker image. I will see if I can try it out on 8xA100.
System Info
TGI Version: v2.0.4 Model:
mistralai/Mixtral-8x22B-Instruct-v0.1
Hardware: 8x Nvidia H100 70GB HBM3 Deployment specificities: OpenShiftInformation
Tasks
Reproduction
Running TGI with
MAX_BATCH_PREFILL_TOKENS=35000
MAX_INPUT_LENGTH=35000
MAX_TOTAL_TOKENS=36864
NUM_SHARD=8
results in the following error:
Expected behavior
The warmup is successful.