-
https://github.com/pytorch/ao/issues/260 libcudart cannot be loaded, but why? We're exporting executorch model
~~~
https://github.com/pytorch/torchchat/actions/runs/9166937828/job/2520327894…
-
Encountered an error while attempting to quantize a model using the ./quantize command. The quantization process failed with the following error message:
```Error:
main: quantizing './models/llama…
-
I'm been using quantization tools like GPTQ, Exllama, or QUIP#. Those tools is quite fast to do quantization in a single A6000 gpu. But, this tool takes a really long time even though I'm using two A6…
-
For 128-slice(or more) ct , cone-beam artifacts are severe,Whether LEAP supports removal of cone-beam artifacts?
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [x] I am running the latest code. Development is very rapid so there are no tagged versions as of…
ryao updated
5 months ago
-
When quantizing, the program crashes while packaging the model.
![image](https://github.com/AutoGPTQ/AutoGPTQ/assets/37856372/428762d2-1812-4bb3-9e7e-25a6c4f5f794)
-
Using b2854
Converted Hermes-2-Theta-Llama-3-8B to F32, then measured imatrix with https://gist.github.com/bartowski1182/b6ac44691e994344625687afe3263b3a
Upon quanting, all sizes work fine, exce…
-
### Is your feature request related to a problem? Please describe
Aggregations are the most used query type in observability use cases and the aggregation is typically on metrics, request logs, etc…
-
# Proposal: gain maps for PNG
**This proposal has no official standing in PNG WG and is presented for discussion only. Do not implement.**
## [3 Terms, definitions, and abbreviated terms](https:…
-
Dear readers,
Thank you for your hard work and for providing such an interesting library.
Actually, I am working on quantization, especially on the YOLOv7 module. I made a small change in the 'C…