quantization Search Results

1000+ results
for quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlc-ai/mlc-llm #715

[Question] Is 2bit quantization possible?

First I want to say THANK YOU to make this project possible. It's amazing how many possibilities will open thanks to this community :) I want to run llama2 on my iPhone, however most of the iPhones…

acalatrava updated 3 weeks ago
4
unslothai/unsloth #1310

what was the quantisation algorithm used in unsloth/Llama-3.…

what was the quantisation algorithm used in unsloth/Llama-3.2-1B-bnb-4bit model: https://huggingface.co/docs/transformers/main/en/quantization/overview. Is it int4_awq or int4_weightonly ?

jayakommuru updated 1 day ago
1
huggingface/transformers.js #998

Cannot import PretrainedModelOptions (or quantization data t…

### System Info Transformers.js 3.0.1 running in node 18 using CommonJS ### Environment/Platform - [ ] Website/web-app - [ ] Browser extension - [X] Server-side (e.g., Node.js, Deno, Bun) - [ ] De…

jens-ghc updated 3 weeks ago
1
cosmicoptima/loom #26

Missing "unknown" option for quantization on OpenRouter prov…

Currently, there is no "unknown" option for quantization from the OpenRouter provider, so models like mistralai/mixtral-8x7b without known quantization do not work

Shoalstone updated 4 weeks ago
1
huggingface/local-gemma #30

8bit quantization

Would it be possible to support 8bit quantization?

paolo-losi updated 4 months ago
1
sgl-project/sglang #1964

[Feature] Is AWQ W4Afp8 supported?

### Checklist - [x] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.…

vkc1vk updated 1 week ago
1
google-ai-edge/ai-edge-torch #369

Quantization of Llama results in TFLite file without prefill…

### Description of the bug: I tried running the example.py script given for quantization example, but for Llama. Wherever the reference to Gemma was made, I made appropriate references to Llama. The…

Arya-Hari updated 1 day ago
5
vllm-project/llm-compressor #848

[Question]Does Minicpmv2.6 currently support int8/fp8 quanti…

Does Minicpmv2.6 currently support int8/fp8 quantization? thanks~

wjj19950828 updated 2 weeks ago
3
NVIDIA/TensorRT #4197

trt10.5 pytorch-quantization has compile bug

## Description trt10.5 pytorch-quantization has compile bug. https://github.com/NVIDIA/TensorRT/blob/release/10.5/tools/pytorch-quantization/src/tensor_quant_gpu.cu#L28-L37 define two macro `AT_DI…

lix19937 updated 1 month ago
1
instructlab/training #28

Add quantization option for full fine-tuning

We want to support the ability to run a full fine-tune with just 8 bit quantization.

RobotSail updated 4 days ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for quantization

1000+ results
for quantization