PygmalionAI / aphrodite-engine

PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
606 stars 78 forks source link

Support sharded ggufs. #420

Closed sgsdxzy closed 3 weeks ago

sgsdxzy commented 4 weeks ago

Support sharded .gguf files in a directory. Also the user can supply the config.json and tokenizer in the directory.