huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
https://huggingface.co/transformers
Apache License 2.0
132.24k stars 26.34k forks source link

Support loading shard GGUF models #32266

Open Isotr0py opened 1 month ago

Isotr0py commented 1 month ago

Feature request

Motivation

Your contribution

LysandreJik commented 1 month ago

Ah very cool! Thanks foir sharing @Isotr0py, it would be nice to indeed have support for this sharding.

If you'd like, we're very open to PRs :)

cc @SunMarc

julien-c commented 1 month ago

maybe cc @phymbert for visibility too

julien-c commented 1 month ago

and linking the related https://github.com/ggerganov/llama.cpp/issues/9023