Open pseudotensor opened 5 months ago
@pseudotensor
I made a fix of the template issue at sglang. You could check the new examples folder with updated code.
https://github.com/sgl-project/sglang/blob/main/examples/usage/llava/
Thanks, trying latest sglang, latest llava-next, and what I think your change to the script was -- i.e. assistant None as last appended thing.
Yes, that maybe fixed the problem. Any idea about the TypeError or size error?
https://github.com/sgl-project/sglang/issues/474 https://github.com/sgl-project/sglang/issues/473
server:
client:
just gives:
No matter how I prompt, the output is extremely terse even if accurate.
I changed the image, but otherwise this is the default script from sglang: https://github.com/sgl-project/sglang/blob/main/examples/usage/llava/http_qwen_llava_test.py
If I try increasing temperature to 0.5, I get no response at all and it just fails:
I don't understand what it means by the TypeError just because of temperature=0.5, very odd. Is that sglang's fault?