spinquant Search Results

39 results
for spinquant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

spcl/QuaRot #28

Relations with SpinQuant?

spinquant: https://arxiv.org/abs/2405.16406

RanchiZhao updated 4 months ago
3
pytorch/ao #579

Spin Quant in TorchAO

Background: The [spin quant paper](https://arxiv.org/pdf/2405.16406) introduces a method of improving quantization by adding additional rotation matrices to the model weights that improve quantizatio…

HDCharles updated 1 month ago
7
pytorch/executorch #5967

Error from LLaMA 3.2 1B Instruct Model generation (.pte)

### 🐛 Describe the bug Currently I'm trying to test LLaMA 3.2 1B Instruct Model as you guided. I was done to test LLaMA2 7B / LLaMA2 3 8B with XNNPACK @ On Device side. I faced some issues du…

HSANGLEE updated 1 month ago
8
facebookresearch/LLM-QAT #29

Looks like an Incorrect READ ME file

Looks like Read me file having the description and commands to run for SpinQuant not LLM QAT.

gdsaikrishna updated 4 months ago
1
facebookresearch/LLM-QAT #28

Question about the training cost

Hi, I appreciate your work! I have a question regarding the training cost. In the introduction it's mentioned that the cost is 1.3 hours on a single A100, but section 4.1 mentions 8 A100s. Could you c…

KimythAnly updated 4 months ago
1
facebookresearch/SpinQuant #3

Question about the training cost

Hi, I appreciate your work! I have a question regarding the training cost. In the introduction it's mentioned that the training cost of LLaMA-2 7B is 1.3 hours on a single A100, but section 4.1 mentio…

KimythAnly updated 3 months ago
2
CryVeck/QuaRot #2

Rotation destroy perplexity

While using the right hidden size for the rotation, Llama 3 8B model perform better WIKITEXT2 PPL: 11.544 > WIKITEXT2 PPL: 8.967 But, for the other models, while running the fake quant: `pyt…

CryVeck updated 1 week ago
5
pytorch/ao #1076

torchao already works on raspberry pi

## Problem We don't publish aarch64 linux binaries so right now we still install ao=0.1 ``` (myvenv) marksaroufim@rpi5:~/Dev/ao $ pip install torchao Looking in indexes: https://pypi.org/simpl…

msaroufim updated 1 month ago
2
pytorch/executorch #6284

Android App Inference Error

model : Llama 3.2 1B (without any quantization) I ran the app, loaded the model, and entered the input, but the following error appears in the middle of the output. ```shell 2024-10-16 17:11:17.7…

j0h0k0i0m updated 1 month ago
28

上一页 1...1 2 3 4...4 下一页

39 results for spinquant

39 results
for spinquant