post-training-quantization Search Results

1000+ results
for post-training-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #22546

RuntimeError: CUDA error: device-side assert triggered when …

### System Info - `transformers` version: 4.28.0.dev0 - Platform: Linux-3.10.0-1160.81.1.el7.x86_64-x86_64-with-glibc2.17 - Python version: 3.11.2 - Huggingface_hub version: 0.13.3 - Safetensor…

TerryCM updated 4 days ago
42
tensorflow/tflite-micro #624

Force symmetric filter weights

From `fully_connected_common.cc` I see that filter weights must be symmetric, i.e. `zero_point=0`. How can I achieve this? Also, is it only possible by using quantization-aware training, or it can it …

tbec updated 1 year ago
2
Deci-AI/super-gradients #1515

RuntimeError: Exporting the operator fake_quantize_per_tenso…

### 🐛 Describe the bug When I try to fill a quantization, my code causes an error: RuntimeError: Exporting the operator fake_quantize_per_tensor_affine to ONNX opset version 9 is not supported. Supp…

Bananaspirit updated 9 months ago
1
openvinotoolkit/nncf #1936

NNCF2.5 When quantizing the model, an error occurred: "Runti…

**I have an ONNX model that contains convolutional layers but no fully connected layers. Upon inspection with Netron, I found that if a convolutional layer is not directly followed by a BatchNormaliza…

edition3234 updated 11 months ago
38
OpenLMLab/LOMO #12

是否支持量化的模型呀？

你好，请问是否支持量化的模型，比如gptq？如果可以的话，按照比例计算的话，我有8张24g的显卡的话，用流水线并行，是不是可以lora 175b版本量化模型了？谢谢~

laoda513 updated 10 months ago
4
onnx/steering-committee #52

Neural compressor Proposal - to add port the repo under Inte…

### ONNX Model Compressor ### Quantization Tool Proposal Intel Neural Compressor(INC) is a tool for generating optimized ONNX models and supports techniques like Post training quantization (P…

liqunfu updated 10 months ago
16
bitsandbytes-foundation/bitsandbytes #416

AttributeError: module 'bitsandbytes.nn' has no attribute 'L…

===================================BUG REPORT=================================== Welcome to bitsandbytes. For bug reports, please run python -m bitsandbytes and submit this information toget…

hennypurwadi updated 8 months ago
16
Xilinx/Vitis-AI #1093

elew0's quantization info's error

Dear, We have a yolov3 tiny model that can run on the DPU. Quantization is ok but when compiling we get the following error: [UNILOG][FATAL][XCOM_UNSUPPORT_QUANTIZATION][The fix info is error o…

damor-rbz updated 1 year ago
2
huggingface/transformers #23904

save_pretrained 4-bit models with bitsandbytes

With the latest version of bitsandbytes (0.39.0) library, isn't it possible to serialize 4-bit models then? Thus this section should be updated to allow the user to save these models. https://gith…

westn updated 9 months ago
11
oobabooga/text-generation-webui #2396

Implement NBCE, a recent trick to extend any LLM's context l…

**Description** Naive Bayes-based Context Extension is a method that uses the idea of naive Bayes to extend the context handling length of large language models as long as there is enough computing…

kabachuha updated 9 months ago
9

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for post-training-quantization

1000+ results
for post-training-quantization