Closed julien-c closed 1 month ago
(since we'll need to keep the architecture list fully synced at all times - which gets updated quite a bit)
Yes, but IMO in the future we (e.g. @ngxson 😁) can automate the syncing of our architecture list with upstream through some CI
companion internal PR
TL;DR: we want the ability to discriminate non-llama.cpp gguf repos
I've picked
${architecture}.context_length
as a property that, if it's in the gguf file, we assume it's compatible with llama.cppAnother option was to check whether
architecture
is inside the list in@huggingface/gguf
(and keep this list up to date with upstream, maybe through some CI)Both options are possible, so let me know what you think is best