Open Isotr0py opened 1 month ago
Ah very cool! Thanks foir sharing @Isotr0py, it would be nice to indeed have support for this sharding.
If you'd like, we're very open to PRs :)
cc @SunMarc
maybe cc @phymbert for visibility too
and linking the related https://github.com/ggerganov/llama.cpp/issues/9023
Feature request
Motivation
transformers
only supports single file GGUF model currently.Your contribution
<ShardNum>-of-<ShardTotal>
, just like shardedsafetensors
.