quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/torchchat #843

[LAUNCH BLOCKER?] https://github.com/pytorch/ao/issues/260 l…

https://github.com/pytorch/ao/issues/260 libcudart cannot be loaded, but why? We're exporting executorch model ~~~ https://github.com/pytorch/torchchat/actions/runs/9166937828/job/2520327894…

mikekgfb updated 4 months ago
1
meta-llama/llama3 #130

Can't quantize the model using LLama.cpp

Encountered an error while attempting to quantize a model using the ./quantize command. The quantization process failed with the following error message: ```Error: main: quantizing './models/llama…

Codedestructor56 updated 4 months ago
2
Vahe1994/AQLM #32

How long does it take to quantize?

I'm been using quantization tools like GPTQ, Exllama, or QUIP#. Those tools is quite fast to do quantization in a single A6000 gpu. But, this tool takes a really long time even though I'm using two A6…

fahadh4ilyas updated 5 months ago
3
LLNL/LEAP #38

remove cone-beam artifacts

For 128-slice(or more) ct , cone-beam artifacts are severe,Whether LEAP supports removal of cone-beam artifacts?

scf819 updated 3 months ago
8
ggerganov/llama.cpp #6830

bf16 support

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…

ryao updated 5 months ago
2
AutoGPTQ/AutoGPTQ #571

[BUG]Killed while packing model

When quantizing, the program crashes while packaging the model. ![image](https://github.com/AutoGPTQ/AutoGPTQ/assets/37856372/428762d2-1812-4bb3-9e7e-25a6c4f5f794)

MontaEllis updated 6 months ago
1
ggerganov/llama.cpp #7311

ggml_validate_row_data finding nan value for IQ4_NL

Using b2854 Converted Hermes-2-Theta-Llama-3-8B to F32, then measured imatrix with https://gist.github.com/bartowski1182/b6ac44691e994344625687afe3263b3a Upon quanting, all sizes work fine, exce…

bartowski1182 updated 4 months ago
7
opensearch-project/OpenSearch #12498

[RFC] Pre Compute Aggregations with Star Tree index

### Is your feature request related to a problem? Please describe Aggregations are the most used query type in observability use cases and the aggregation is typically on metrics, request logs, etc…

bharath-techie updated 1 month ago
37
w3c/png #380

Proposal: gain maps for PNG

# Proposal: gain maps for PNG **This proposal has no official standing in PNG WG and is presented for discussion only. Do not implement.** ## [3 Terms, definitions, and abbreviated terms](https:…

svgeesus updated 2 weeks ago
41
Xilinx/brevitas #627

Quantizing Yolov7

Dear readers, Thank you for your hard work and for providing such an interesting library. Actually, I am working on quantization, especially on the YOLOv7 module. I made a small change in the 'C…

IsmailAM1999 updated 7 months ago
10

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for quantizing

1000+ results
for quantizing