Closed Christopheraburns closed 2 months ago
Sorry I seem to have not noticed this. It's a system memory issue, same as #504, and the culprit is memory-mapping in safetensors. I'm trying to see if I can rely less on that library when converting models. Track in the other issue.
Discussed in https://github.com/turboderp/exllamav2/discussions/178