sparsegpt Search Results

68 results
for sparsegpt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

IST-DASLab/sparsegpt #6

Hessian Inverse

Hello, SparseGPT is really an amazing job. I am wondering in `https://github.com/IST-DASLab/sparsegpt/blob/master/sparsegpt.py#L77`, why cholesky decomposition is performed on the inversed hess…

Calmepro777 updated 1 year ago
1
microsoft/Olive #782

[Bug]: AssertionError: No valid accelerator specified for ta…

### What happened? the error happens, when I run "python -m olive.workflows.run --config bert_cuda_gpu.json". What should I do? ### Version? main

Arthur-Ling updated 11 months ago
4
dilab-zju/self-speculative-decoding #7

Proposal: Evaluating Faster, Deterministic Alternatives to B…

Bayesian optimization can sometimes be very time-consuming, especially for models of significant size. I am interested in exploring how it compares in efficiency with faster methods. Recent advance…

azurespace updated 1 year ago
1
locuslab/wanda #27

Questions about sub-networks of LLMs

Hello~. I am reading your paper and notice that you have mentioned lots of times that "exact and effective sparse sub-networks exist for LLMs". But I am a little confused and do not get it, your pruni…

JiwenJ updated 11 months ago
2
IST-DASLab/sparsegpt #22

Adaptation for Pruning Conv2d or Conv3d Layers?

How would I proceed to adapt the "add_batch" function to make the pruning possible on a Conv layer? Am I missing something here. Any suggestions are greatly appreciated. Thanks in advance.

satabios updated 1 year ago
1
locuslab/wanda #15

How to use sparseGPT to prune the output dimension?

How to use sparseGPT to prune the output dimension? When I was calculating the Hessian matrix, the input dimension did not match the Hessian matrix dimension

wfan1203 updated 1 year ago
3
locuslab/wanda #5

"line 39, in main parser.add_argument('--save_model', **…

little typo there, would putting my own dataset into here made a difference? ive been having an itch since i heard about sparsegpt to see how close i can tool it to task orient a model

Alignment-Lab-AI updated 1 year ago
3
locuslab/wanda #22

calibration data seq_length

Hi, I have a question about calibration data (128, 2048 tokens, respectively) Is there a particular reason to use 2048 tokens for each data? I tracked [SparseGPT](https://arxiv.org/pdf/2301.00774.…

kiucho updated 1 year ago
1
IST-DASLab/sparsegpt #12

Different error between OBS and SparseGPT

Following OBS, we want to remove the weights with minimum error $w_m^2/H_{mm}^{-1}$ . But, in the SparseGPT algorithm, we use $w_m^2/{H^{-1}}_{mm}^2$ instead. I'm not sure if it is equivalent betwe…

sbwww updated 1 year ago
5
neuralmagic/deepsparse #1076

I see that SparseGPT has been integrated into your project. …

**Describe the bug** A clear and concise description of what the bug is. **Expected behavior** A clear and concise description of what you expected to happen. **Environment** Include all rele…

18140663659 updated 1 year ago
3

上一页 1...1 2 3 4 5 6 7...7 下一页

68 results for sparsegpt

68 results
for sparsegpt