quant Search Results - Githubissues

1000+ results
for quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #913

Converting unsloth finetuned model to AWQ using autoawq pack…

Firstly, I saved finetuned LORA model as merged_16bit on my huggingface repo. And i have adapter_config.json and adapter_model.safetensor inside my repo. Now when trying to load with AutoAWQForCausalL…

fusesid updated 4 days ago
9
pytorch/ao #1057

How to use float8 with SM89 hardware - i.e. NVIDIA A6000 ADA…

I am running torchao: 0.5 and torch: '2.5.0a0+b465a5843b.nv24.09' on an NVIDIA A6000 ADA card (sm89) which supports FP8. I ran the generate.py code from the benchmark: python generate.py --c…

vgoklani updated 3 weeks ago
2
huggingface/trl #2217

[GKD] 0 loss

### System Info ``` pip install git+https://github.com/huggingface/transformers.git pip install tokenizers==0.20.0 pip install accelerate==0.34.2 pip install git+https://github.com/huggingface/tr…

nivibilla updated 3 weeks ago
4
vllm-project/vllm #6503

[New Model]: Support for Telechat

### The model to consider. https://huggingface.co/Tele-AI/TeleChat-12B ### The closest model vllm already supports. qwen2 ### What's your difficulty of supporting the model you want? I …

hzhaoy updated 3 weeks ago
2
kijai/ComfyUI-MochiWrapper #55

FP8 scaled by Comfy-Org, "KeyError: 'pos_frequencies'"

Hiya, Comfy-Org put out an FP8 scaled version of Mochi. Curious to try what kind of quality can be gotten out of it, but it doesn't seem compatible with this repo. https://huggingface.co/Comfy-…

jepjoo updated 2 weeks ago
2
geekan/MetaGPT #1553

ValueError: Creator not registered for key: LLMType.OLLAMA

**Bug description** I using ***MetaGPT ver 0.8.1*** but when use RAG with method **SimpleEngine.from_docs** have error ***ValueError: Creator not registered for key: LLMType.OLLAMA*** **…

vanhocpham updated 2 weeks ago
6
amd/RyzenAI-SW #122

Error during YOLOv8s quantization with Ryzen AI quantizer (R…

I encountered an issue while trying to quantize the YOLOv8s model using the Ryzen AI quantizer. Below are the details of the error: ### Error Message: ``` No CUDA runtime is found, using CUDA_HOM…

Siva50005 updated 1 month ago
11
pytorch/pytorch #137280

Possible inductor pattern matcher bug when patterns/replacem…

### 🐛 Describe the bug The following script attempts to fuse two custom operations together into a single custom op. One of the original ops, plus the fused op have multiple outputs. The resultin…

bnellnm updated 1 week ago
4
Proteobench/ProteoBench #436

Datapoint metric is called 'median_abs_epsilon' but is actua…

Median_abs_epsilon is calculated here: https://github.com/Proteobench/ProteoBench/blob/8ed5b5ad9588b5b8b10c3b0cfd9ec284be43a59a/proteobench/datapoint/quant_datapoint.py#L124-L127 So is actually…

rodvrees updated 5 minutes ago
8
nf-core/scrnaseq #178

Reduce the number of published files from alevin-fry output

### Description of feature I am using the alevin-fry quantitation method. For downstream analysis, I am mainly interested in the final count matrix, e.g. 1. The content of the `af_quant/alevin di…

tomsing1 updated 1 week ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for quant

1000+ results
for quant