auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lyogavin/airllm #158

can not run llama 3.1 405B

run on Mac M3 Max 128GB run this code ``` from transformers import AutoModel, AutoTokenizer MAX_LENGTH = 128 model = AutoModel.from_pretrained("unsloth/Meta-Llama-3.1-405B-Instruct-bnb-4b…

taozhiyuai updated 1 month ago
2
PaddlePaddle/Paddle2ONNX #1131

OCR自动量化后导出为onnx失败

https://github.com/PaddlePaddle/PaddleSlim/tree/develop/example/auto_compression/ocr 该方案更换ICDAR2015数据集，采用预训练ResNet50模型（更改模型配置即可）可以成功运行，其精度基本不变，速度减少为1/4，获得Inference模型。此时的模型在转为ONNX时报错，缺少量化配置文件（cali…

xu-peng-7 updated 2 months ago
2
quic/aimet #2197

Training using 8 bit wights and 16 bit activations and run o…

Hi, i trained a model using 16/8 configuration (attach the configuration JSON i used) and everything was fine during AIMET optimization. When I try to deploy the model the the DSP I use the commands m…

AlmogDavid updated 1 year ago
4
princeton-nlp/SimPO #41

How to use local dataset

Since my training environment could not connect to the internet, I download the model and dataset and save them in the local disk. The arguments: **model path**: ModelArguments(base_model_revision=N…

mazhengyufreedom updated 2 months ago
3
huggingface/optimum-habana #1237

Quantization failed

### System Info ```shell The examples provided do not work correctly, I think there has been updates in the intel neural compressor toolkit, which is now 3.0. and the habana quantization toolkit, and…

endomorphosis updated 1 month ago
6
NetEase-FuXi/EETQ #24

add Qwen2

Please add Qwen2 support ``` EETQ_CAUSAL_LM_MODEL_MAP = { "llama": LlamaEETQForCausalLM, "baichuan": BaichuanEETQForCausalLM, "gemma": GemmaEETQForCausalLM } ```

ehartford updated 2 months ago
6
vllm-project/vllm #8303

[Bug]: 3090 P@P

### Your current environment The output of `python collect_env.py` ```text C:\Users\bobni\OneDrive\Desktop\Projects\p2pIssue>bash training@Training:/mnt/c/Users/bobni/OneDrive/Desktop/Projects…

NicolasMejiaPetit updated 2 weeks ago
2
casper-hansen/AutoAWQ #558

Quantitative model report wrong, RuntimeError: Expected all …

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat2 in method wrapper_CUDA_bmm) ```python from aw…

ShelterWFF updated 3 days ago
13
OCA/stock-logistics-workflow #913

Migration to version 15.0

# Todo https://github.com/OCA/maintainer-tools/wiki/Migration-to-version-15.0 # Modules to migrate - [ ] delivery_package_default_shipping_weight - [x] delivery_procurement_group_carrier - B…

OCA-git-bot updated 3 weeks ago
16
modelscope/ms-swift #1222

多机多卡训练出现问题

设备为两台linux，每台2张A100 40G显卡：A100(40G) * 2 训练命令如下：主节点命令为CUDA_VISIBLE_DEVICES=0,1 NNODES=2 NODE_RANK=0 NPROC_PER_NODE=2 MASTER_ADDR=127.0.0.1 swift sft --model_type qwen1half-7b-chat --model_id_or_path /…

Jamly7 updated 1 month ago
12

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant