Closed dsikka closed 9 months ago
I ran through the script using hf:neuralmagic/TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds
as the model and installing pip install fschat accelerate
Looks like something about the last message handshake went wrong.
ChatCompletionChunk(id='cmpl-c735b32f15c043b49893cd6a0ac7ab96', choices=[Choice(delta=ChoiceDelta(content='', function_call=None, role=None, tool_calls=None), finish_reason='length', index=0)], created=1701898636, model='hf:neuralmagic/TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds', object='chat.completion.chunk', system_fingerprint=None)
httpx.RemoteProtocolError: peer closed connection without sending complete message body (incomplete chunked read)
File "/Users/mgoin/code/deepsparse/src/deepsparse/server/openai_server.py", line 159, in abort_request
await pipeline.abort(request_id)
AttributeError: 'TextGenerationPipeline' object has no attribute 'abort'
I ran through the script using
hf:neuralmagic/TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds
as the model and installingpip install fschat accelerate
Looks like something about the last message handshake went wrong.
ChatCompletionChunk(id='cmpl-c735b32f15c043b49893cd6a0ac7ab96', choices=[Choice(delta=ChoiceDelta(content='', function_call=None, role=None, tool_calls=None), finish_reason='length', index=0)], created=1701898636, model='hf:neuralmagic/TinyLlama-1.1B-Chat-v0.4-pruned50-quant-ds', object='chat.completion.chunk', system_fingerprint=None) httpx.RemoteProtocolError: peer closed connection without sending complete message body (incomplete chunked read)
File "/Users/mgoin/code/deepsparse/src/deepsparse/server/openai_server.py", line 159, in abort_request await pipeline.abort(request_id) AttributeError: 'TextGenerationPipeline' object has no attribute 'abort'
What script did you use? The example script in the PR description? That seems to work for me. If you send me your code/example, I can investigate.
Summary
/v1/completions
endpoint/v1/chat/completions
to accept/handle FastChat-compliant dictionariesTesting