auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

polakowo/vectorbt #697

Open short position right after closing long position with f…

First of all, thanks for developing this excellent library! My strategy will enter short/long position right after closing long/short position, while short/long signal happens. And each postion wil…

kevin851066 updated 5 months ago
3
PaddlePaddle/Paddle2ONNX #1218

PaddleSeg-release-2.8.1 的量化模型转onnx onnx转rknn 没有成功的问题

问题1：我使用PaddleSeg-release-2.8.1 的方式进行训练自己的数据集，然进行感知量化训练在PaddleSeg-release-2.8.1/deploy/slim/quant文件下进行了训练动态转静态。然后使用 paddle2onnx的静态文件转onnx文件没有生成 onnx为文件 (PaddleSeg) D:\PY\PaddleSeg-rele…

1314520gu updated 5 months ago
2
AutoGPTQ/AutoGPTQ #582

Loss Nan when I quant Qwen1.5 14B chat model

I used auto_gptq 0.7.1 and run this code: python quant_with_alpaca.py --pretrained_model_dir Qwen1.5-14B-Chat --quantized_model_dir Qwen1.5-14B-Chat_4bit --use_triton --save_and_reload --trust_remote…

Minami-su updated 5 months ago
6
PaddlePaddle/PaddleSlim #1808

使用paddleslim的模型自动化压缩工具ACT报错

按照paddleslim/example/auto_compression的Readme.md进行操作，运行运行自动化压缩时报错： ``` Traceback (most recent call last): File "/aidata/CYHan/auto_compass.py", line 42, in ac.compress() File "/root/ana…

EddieEduardo updated 7 months ago
1
AutoGPTQ/AutoGPTQ #119

[BUG] non positive-definite cholesky factorization

**Describe the bug** ``` 2023-05-31 11:33:20 INFO [auto_gptq.modeling._base] Quantizing mlp.dense_4h_to_h in layer 7/70... Traceback (most recent call last): File "quant_with_alpaca.py", line 17…

Lihengwannafly updated 1 year ago
2
Rupesh-rkgit/FineTuning-and-Inference-Llama2 #1

Llama 2 7B model Inference time issue

hi, How do i improve the inference time of my Llama2 7B model?.... i used BitsAndBytesConfig also but this does not seem to fasten the inference time! code: `name = "meta-llama/Llama-2-7b-cha…

Rahu218 updated 11 months ago
1
NVIDIA/TensorRT-LLM #663

256G mem is Not Enough （AWQ 4bit LLama 70b）

v 0.6.1 ```bash python quantize.py --model_dir ./hg_weight_3999/ --dtype float16 --qformat int4_awq --export_path ./quantized_int4-awq --calib_size 32 ``` ```log Using pad_token, but it is not se…

busishengui updated 6 months ago
3
NVIDIA/TensorRT-Model-Optimizer #46

CNN model opt int8 best practice example

Hi, can you share best practices for quantization for CNN models? Are the modelopt quantized PTQ is the way to go with tensorrt for cnn models (resnet retinanet etc)? I was able to quantize retinanet…

korkland updated 1 month ago
3
jy-yuan/KIVI #28

Multi GPUs

I ran mem_spd_test.py and got the following error: RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! I did not make any changes except …

yisunlp updated 4 weeks ago
5
olxgroup-oss/libvips-rust-bindings #114

[Bug]The version of libvips may have been confused in versio…

In [README.md](https://github.com/olxgroup-oss/libvips-rust-bindings/blob/v1.7.0/README.md), libvips is noted as 8.14.5, however, the test example in which executed failed with the following error. …

SethWen updated 4 months ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant