spinquant Search Results

40 results
for spinquant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/SpinQuant #14

Clarification of RMSNorm layer fusion

Hi, Thanks very much for your work and for publishing your code. I am currently working on integration of SpinQuant into [torch/ao](https://github.com/pytorch/ao/pull/983/), and I would like to cla…

tobiasvanderwerff updated 2 weeks ago
1
intel/auto-round #219

[Low priority]add support of quarot，spinquant and DuQuant

wenhuach21 updated 1 month ago
1
facebookresearch/SpinQuant #13

Request update of llama3.2 1b and 3b experimental results

I recently tried to save the model after R @ Weight obtained by SpinQuant, and found that the quantization effect was not improved much after hadamard_utils.get_hadK did not match the appropriate size…

shifeiwen updated 3 weeks ago
2
pytorch/executorch #6388

Error from LLaMA 3.2 3B Instruct Model generation (.pte)

### 🐛 Describe the bug Currently I'm trying to test LLaMA 3.2 3B Instruct Model as you guided. but, I faced some issues during pte generation for LLaMA 3.2 3B Instruct Model with QNN @ On Device sid…

justin-Kor updated 1 week ago
18
pytorch/executorch #6571

Android: Tensor type is not very friendly to BFloat16

### 🐛 Describe the bug After https://github.com/pytorch/executorch/issues/6284#issuecomment-2423431020 patch that original UTF-8 invalid character issue had fixed,there is a new issue in tensor type …

JamePeng updated 1 month ago
1
pytorch/executorch #6655

How To Building and Running Llama 3.2 1B Instruct with Qualc…

### Right Case When I follow the doc : https://github.com/pytorch/executorch/blob/main/examples/models/llama/README.md#enablement, I export the Llama3.2-1B-Instruct:int4-spinquant-eo8 model to xnnpa…

baotonghe updated 3 weeks ago
5
ggerganov/llama.cpp #6444

Support QuaRot quantization scheme

A new, interesting quantization scheme was published, which not only reduces memory consumption (like current quantization schemes), but als reduces computations. > **[QuaRot: Outlier-Free 4-Bit In…

EwoutH updated 4 days ago
14
pytorch/executorch #6848

meta-llama/Llama-3.2-1B, The model is not working properly

### 📚 The doc issue I use this command transform model(Llama-3.2-1B) ``` python -m examples.models.llama.export_llama --checkpoint "${MODEL_DIR}/consolidated.00.pth" -p "${MODEL_DIR}/params.json" -…

yangh0597 updated 2 weeks ago
4
pytorch/executorch #6975

Missing Out Variants When Running Llama3.2 Example Without X…

I am follwing the [instructions in the Llama2 README](https://github.com/pytorch/executorch/blob/d9aeca556566104c2594ec482a673b9ec5b11390/examples/models/llama2/README.md#instructions) to test llama m…

sheetalarkadam updated 6 days ago
3
ruikangliu/FlatQuant #7

Some questions about reproducing SpinQuant PPL(5.9 in SpinQu…

Hello, everyone! Thank you for the contributions to the quantization works! I would like to discuss with you some issues regarding reproducing the results of the SpinQuant paper. I use the code fro…

WeiMa01 updated 3 weeks ago
3

上一页 1...1 2 3 4...4 下一页

40 results for spinquant

40 results
for spinquant