Closed alexanderdicke-webcom closed 4 months ago
Hey @alexanderdicke-webcom! Are you using the docker image with version 2.0.4
or have you built it locally to that version? I haven't seen this error before and don't have a 8xH100 handy; do you get the same issue on 8xA100?
Hey @LysandreJik! We are using the official docker image. I will see if I can try it out on 8xA100.
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.
System Info
TGI Version: v2.0.4 Model:
mistralai/Mixtral-8x22B-Instruct-v0.1
Hardware: 8x Nvidia H100 70GB HBM3 Deployment specificities: OpenShiftInformation
Tasks
Reproduction
Running TGI with
MAX_BATCH_PREFILL_TOKENS=35000
MAX_INPUT_LENGTH=35000
MAX_TOTAL_TOKENS=36864
NUM_SHARD=8
results in the following error:
Expected behavior
The warmup is successful.