Open EgyipTomi425 opened 6 days ago
Hi. I use Llama 3, and I'd like to stream the output. I mean, it should be somehow with the 8001 port API. I'd like generate few tokens and send it to client time by time. Is it possible? It could help me a lot. Have a good day.
Hi. I use Llama 3, and I'd like to stream the output. I mean, it should be somehow with the 8001 port API. I'd like generate few tokens and send it to client time by time. Is it possible? It could help me a lot. Have a good day.