Closed sarda-nikhil closed 8 months ago
Currently, the streaming output skips the last few tokens, leaving the output looking truncated.
I'll take a look! That might be because of the max_new_tokens.
max_new_tokens
Currently, the streaming output skips the last few tokens, leaving the output looking truncated.