-
Due to a current default in the boost library (https://github.com/boostorg/math/issues/1211) in boost::math::digamma, there is a performance hit on aarch64.
This happens on v1.10.3 of Salmon, with …
-
Hi,
I am trying to run this pipeline on computing cluster but it fails due to lack of resources. Wondering where to set the lower resource requirements. I gave 64 cpus and 96 RAM, even then it fai…
-
Thanks for the excellent work!
I use examples/basic_quant_mix.py to quantize the Qwen2-7B model with --w_bit 8. It's very strange that the quantized model is even larger than the original model.
…
-
I tested two quantization methods on a 3B model: w8a8 smooth quant and int8 weight-only quant.
The following is the efficiency of different optimization methods .I'm a little confused, Is int8 wei…
-
I tried to quantize the model into BF16 and FP16 to preserve a bit more precision than NF4/FP8 while still running fast on data center cards. However, when I tried to hack the way to get it to run i…
-
# Quantified with the Yolov5 model, the MAP@0.5 is high(around 0.47), but the detection results are outrageous and unexpected
These days I have tried to do some quantification with yolov5_nano by …
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### Ultralytics YOLO Component
Expo…
ssdv1 updated
17 hours ago
-
Running the tutorial "run_vit_b_quant.py" and report following errors in 'benchmark_model(model,20,inputs)':
torch._dynamo.exc.InternalTorchDynamoError: 'PlainAQTLayout' object has no attribute 'layo…
-
Hi Michael
A collaborator has to analyze 3000+ files ( dda orbitrap so around 700MB - 1.5GB per file )
FragPipe ran out of memory with 1TB on a VM during quant....
I was wondering wherher SAGE mi…
-
The paper https://arxiv.org/abs/quant-ph/0406176 introduces an algorithm for performing quantum shannon decomposition. We have this algorithm implemented in Cirq in https://github.com/quantumlib/Cirq/…