model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gongouveia/Resnet-Quantization-Experiments #1

When I try to run this project, I find there is a bug in qua…

# Quantize the model model_prepared = tq.prepare(model_fused) model_quantized = tq.convert(model_prepared) # Define the quantization configuration quant_config = tq.get_default_qconfig('fbge…

DomineeringDragon updated 1 week ago
2
rachtibat/LRP-eXplains-Transformers #14

Making Llama 3 quantized work

Do you have examples for working with a quantized llama3? I'm trying with ``` from transformers import BitsAndBytesConfig quantization_config = BitsAndBytesConfig( load_in_8bit=True, b…

aymeric-roucher updated 2 weeks ago
1
microsoft/VPTQ #56

How can I use VPTQ to quantize my own models?

As far as I could see, the quantization methods was not provided in this project. All examples showed here were how to inference with vptq models, rather than the quantization tutorials. Or I might…

IEI-mjx updated 1 week ago
8
vllm-project/vllm #8799

[Bug]: Loading a model with bitsandbytes 8bit quantization

### Your current environment Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Debia…

IkhlasAlhussien updated 3 weeks ago
1
meta-llama/llama-stack #244

Does Quantization (FP8) support the Llama3.2-90B-Vision-Inst…

Hello, I encountered some problems when loading the Llama3.2-90B-Vision-Instruct model with FP8. Can you help me take a look? Version of llama_stack and llama_models: ``` llama_models == 0.0.41 …

boanz updated 1 week ago
1
pytorch/torchtune #1701

torchtune quantization has different model output comparing …

I'm using torchtune for model quantization with QAT. Currently, I am learning based on https://pytorch.org/torchtune/main/tutorials/qat_finetune.html, but the results of the prepared_model I printed a…

elfisworking updated 3 weeks ago
1
tensorflow/model-optimization #1145

TFOpLambda not supported in INT8 Quantization Aware Training…

**Describe the bug** I cannot quantize Mobilenetv3 from keras2 because the hard-swish activation fuction is implemented as a TFOpLambda. **System information** tensorflow version: 2.17 tf_ke…

pedrofrodenas updated 2 weeks ago
1
ollama/ollama #7268

fail to run ollama run hf-mirror.com/Orenguteng/Llama-3.1-8B…

### What is the issue? taozhiyu@Mac ~ % ollama run hf-mirror.com/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2-GGUF:Q8 pulling manifest Error: pull model manifest: 400: The specified tag is not a v…

taozhiyuai updated 6 hours ago
3
ultralytics/ultralytics #14711

Incorrect Output Results for Quantization of my Yolov8n Mode…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and f…

BeyzaSimsekk updated 1 day ago
6
tensorflow/tensorflow #75558

MLIR quantizer produces asymmetric quantization for int16 ac…

### 1. System information - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Ubuntu 22.04 - TensorFlow installation (pip package or built from source): pip package - TensorFlow library (v…

tagunil updated 3 weeks ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization