-
在convert_hf_to_gguf.py文件中,转换MiniCPM模型的时候,如下类override了modify_tensors,并且只转换了q_proj.weight和k_proj.weight,请问为什么需要转换呢?或者如注释所说“HF models permute some of the tensors, so we need to undo that”,HF model是在那里做了这…
-
### Feature request
Recently, we have added the ability to load `gguf` files within [transformers](https://huggingface.co/docs/hub/en/gguf).
The goal was to offer the possibility to users …
-
Hi,
I'm experiencing an error when using the ComfyUI-GGUF custom node with ComfyUI. The error occurs during image generation and seems related to the cast_to function receiving an unexpected keywor…
-
Hi @minuszoneAI
Please upload GGUF model to hugging face. The link from china server is very slow to download.
Thank you
-
### Cortex version
`cortex run` redownloads existing model multiple times
### Describe the Bug
2 issues (see screenshot)
- tinyllama:gguf is already downloaded
- `cortex run tinyllama:gguf` is su…
-
hi, since native comfyui already supported t5 GUFF text encoder to speed up loading Flux model [link](https://www.reddit.com/r/StableDiffusion/comments/1ewpwtp/flux1_t5_v11xxl_gguf_clip_encode_compare…
-
**Please describe the feature you want**
Tabby will now download gguf model by the URL specified in the model registry,
but it only supports one URL per model, the vec is used for selecting one UR…
-
Is there any plan to support GGUF format directly apart from SafeTensor, that will allow to use this to load other GGUF's. If support already exists can we add it to readme file.
-
Hi, can you provide mmproj + gguf files for use in llama.cpp?
-
问题:
1. 可以通过查询参数准确过滤出 GGUF 的模型吗?
当前状况:
1. 搜索模型时可以通过空格加上 gguf 字符串,比如:“qwen1.5 gguf”,这样搜索的结果大多数都是 gguf 的模型,但是还会返回一些不是 gguf 的模型。
2. 返回的模型 Tags 中也没有相应的标识。
期望:
1. 类似 Hugging Face 一样可以通过参数…