lyogavin / airllm

AirLLM 70B inference with single 4GB GPU
Apache License 2.0
5.28k stars 423 forks source link

Fixing mlx model load #174

Closed Razikus closed 2 months ago

Razikus commented 3 months ago

fixes https://github.com/lyogavin/airllm/issues/116