Closed amihalik closed 3 hours ago
An earlier version of TGI sha-cbced7f
does not appear to have the same issue.
Thanks for the report @amihalik, I confirm I get the same error after two requests
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.
System Info
g6.12xlarge, docker container: ghcr.io/huggingface/text-generation-inference:sha-90184df
Information
Tasks
Reproduction
Start up Phi-3 on TGI on a g6.12xlarge
This simple query will return just fine
This simple query will crash the TGI server (note you might have to run it a couple times):
Note: I tried the same test using
Qwen/Qwen2-7B-Instruct
and TGI didn't die.In the trace below, I ran Step 2 a few times and then Step 3 once to crash TGI:
Expected behavior
TGI returns a generated_text.