Open crzwork2024 opened 2 days ago
did you use GPU or CPU? how many tokens per sec in server log?
Hi, I was using GPU, and where to check the number of tokens per sec? I basically did nothing and just was following the github instruction, nothing changed in your original code. Are there any parameters I should play around in order to work smoothly? thanks!
I can add that the input was always interuppted around 2 seconds, not sure why
Great project.
The installation process in Windows went smoothly for me, but during conversations, the real-time dialogue is intermittent, while the manually generated dialogue is very smooth. Does anyone know the reason?
Thanks!