turboderp / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs
MIT License
3.45k stars 257 forks source link

fix infer websocket_actions.py #182

Closed Kerushii closed 9 months ago