Open cszhz opened 2 months ago
@cszhz thank you for your feedback.
According to your traces, the server started normally. What do you mean when you say it hangs ?
What do you get when you query its URL using CURL or the huggingface_hub
inference client ?
Hi @dacorvo I don't think server started normally, In the previous 0.0.21 image, it worked fine. Here is the response from docker container 763104351884.dkr.ecr.us-west-2.amazonaws.com/huggingface-pytorch-tgi-inference:2.1.2-optimum0.0.24-neuronx-py310-ubuntu22.04
curl 127.0.0.1:8080/generate \
-X POST \
-d '{
"inputs":"What is Deep Learning?",
"parameters":{
"max_new_tokens":20
}
}' \
-H 'Content-Type: application/json'
curl: (56) Recv failure: Connection reset by peer
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.
System Info
Who can help?
Inference @dacorvo, @JingyaHuang TGI @dacorvo
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction (minimal, reproducible, runnable)
I confirm
optimum-neuron
version: 0.0.21 with Neuron 2.18.2 is working fine.After about 1 minutes, the server hangs
Expected behavior
The TGI server can be started normally.