-
Hello,
SparseGPT is really an amazing job.
I am wondering in `https://github.com/IST-DASLab/sparsegpt/blob/master/sparsegpt.py#L77`, why cholesky decomposition is performed on the inversed hess…
-
### What happened?
the error happens, when I run "python -m olive.workflows.run --config bert_cuda_gpu.json". What should I do?
### Version?
main
-
Hello~. I am reading your paper and notice that you have mentioned lots of times that "exact and effective sparse sub-networks exist for LLMs". But I am a little confused and do not get it, your pruni…
-
Bayesian optimization can sometimes be very time-consuming, especially for models of significant size. I am interested in exploring how it compares in efficiency with faster methods.
Recent advance…
-
How would I proceed to adapt the "add_batch" function to make the pruning possible on a Conv layer? Am I missing something here.
Any suggestions are greatly appreciated. Thanks in advance.
-
How to use sparseGPT to prune the output dimension? When I was calculating the Hessian matrix, the input dimension did not match the Hessian matrix dimension
-
little typo there, would putting my own dataset into here made a difference? ive been having an itch since i heard about sparsegpt to see how close i can tool it to task orient a model
-
Hi, I have a question about calibration data (128, 2048 tokens, respectively)
Is there a particular reason to use 2048 tokens for each data?
I tracked [SparseGPT](https://arxiv.org/pdf/2301.00774.…
-
Following OBS, we want to remove the weights with minimum error $w_m^2/H_{mm}^{-1}$ . But, in the SparseGPT algorithm, we use $w_m^2/{H^{-1}}_{mm}^2$ instead.
I'm not sure if it is equivalent betwe…
-
**Describe the bug**
A clear and concise description of what the bug is.
**Expected behavior**
A clear and concise description of what you expected to happen.
**Environment**
Include all rele…