model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-research/google-research #2278

[kws_streaming] The quantization results in significant prec…

I use the following code to convert the internal state model into TFLite ``` converter.optimizations = [tf.lite.Optimize.DEFAULT] converter.representative_dataset = representative_dataset_gen conv…

ctwillson updated 1 week ago
1
NVIDIA/TensorRT-LLM #1591

Feature Request: "Model Zoo" for quantization

TensorRT-LLM has great potential for allowing people to run larger models efficiently with limited hardware resources. Unfortunately, the current quantization workflow requires significant computation…

atyshka updated 1 week ago
7
speechbrain/speechbrain #2764

SpeechBrain Quantization refactoring

I'd like to raise a concern about how quantization is currently handled in SpeechBrain. While training my own k-means quantizer on the last layer of an ASR model, I noticed that the interface was not …

Adel-Moumen updated 3 days ago
3
unslothai/unsloth #1266

Couldn't build proto file into descriptor pool! Invalid prot…

%%capture !pip install unsloth "xformers==0.0.28.post2" # Also get the latest nightly Unsloth! !pip uninstall unsloth -y && pip install --upgrade --no-cache-dir "unsloth[colab-new] @ git+https://gi…

CurtiusSimplus updated 1 week ago
9
vllm-project/llm-compressor #852

Why is the speed does not increase after compressed it?

https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct https://github.com/vllm-project/llm-compressor/tree/main/examples/quantization_w8a8_fp8 https://github.com/vllm-project/llm-compressor/tre…

liho00 updated 1 month ago
8
quic/aimet #3438

Asking for a guide on quantization process utilizing SNPE af…

Hello authors, Thank you for your excellent work. I've tried utilizing AIMET to resolve a severe performance degradation issue caused by quantization while using the SNPE library. However, I've …

chewry updated 1 week ago
5
mit-han-lab/nunchaku #20

Question about why getting blank image when using flux Contr…

generate image code detail ```python from diffusers import FluxTransformer2DModel import torch def load_flux_model( model_path: str, load_from_file: bool = True, dtype: …

chuck-ma updated 4 days ago
7
airockchip/ultralytics_yolo11 #2

converting onnx to pt

### Search before asking - [x] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

SisroCodeke updated 1 week ago
1
pytorch/xla #8373

int8 StableHLO export

## 🐛 Bug I'm looking at generating a int8 quantised PyTorch model (both weights and activations at int8), and exporting to StableHLO via `torch-xla`'s `exported_program_to_stablehlo`. Right no…

Wheest updated 7 hours ago
3
google-ai-edge/ai-edge-torch #369

Quantization of Llama results in TFLite file without prefill…

### Description of the bug: I tried running the example.py script given for quantization example, but for Llama. Wherever the reference to Gemma was made, I made appropriate references to Llama. The…

Arya-Hari updated 4 days ago
5

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization