Closed yinochaos closed 8 months ago
how about setting tensor_parallel_size=2
?
Closing this issue as stale as there has been no discussion in the past 3 months.
If you are still experiencing the issue you describe, feel free to re-open this issue.
code
llama_infer.py
python llama_infer.py wrong infos
but when I change 70b to 13b, it works,no any error info