Open medwang1 opened 5 months ago
This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!
Your current environment
🐛 Describe the bug
how to deploy model
I use the fp8 model
Yi-1.5-34B-Chat-FP8
generated by the above python script. Then I have a pressure test with concurrency 128. Then have a error log as the below: