sparsegpt Search Results

locuslab/wanda #68

Memory and time requirements for Mistral-7B

Hi, I am trying to prune Mistral 7B (https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) and while I was able to successfully run the commands for magnitude pruning, I was facing issues with…

NamburiSrinath updated 2 weeks ago

neuralmagic/sparseml #2357

Default quantization- True or false in SparseGPT

Hi! in the recipe, if i do not want to quantize and perform structured pruning, is it okk to give quantize:false like below and do not provide QuantizationModifier in the recipe? SparseGPTModif…

sriyachakravarthy updated 1 month ago

google-research/jaxpruner #9

Request for Optimal Brain Surgeon -- SparseGPT

Hi and thanks for the amazing repo. I have a bit of tall request. SparseGPT uses a per-layer optimal brain surgeon approach to pruning. Here is the [pytorch code](https://github.com/IST-DASLab/spar…

opooladz updated 3 months ago

Lightning-AI/lit-llama #109

Support SparseGPT

Sparse should reduce the size and increase infer speed without hurting perf too much. This repo https://github.com/IST-DASLab/sparsegpt is Apache license and may be useful (I hope)

tiendung updated 1 year ago

NolanoOrg/sparse_quant_llms #1

sparseGPT tool

When will the tool be ready for use?

YixinSong-e updated 1 year ago

IST-DASLab/sparsegpt #20

Would sparsegpt be available for Llama2?

moonlightian updated 6 months ago

IST-DASLab/sparsegpt #35

AttributeError: 'NoneType' object has no attribute 'shape'

``` (textgen) [root@pve-m7330 sparsegpt]# python llama.py ../text-generation-webui/models/TinyLlama-1.1B-Chat-v1.0/ wikitext2 --nsamples 10 Token indices sequence length is longer than the specified…

thistleknot updated 2 months ago

AutoGPTQ/AutoGPTQ #161

[BUG] have a plan to achieve sparseGPT?

**Describe the bug** A clear and concise description of what the bug is. **Hardware details** Information about CPU and GPU, such as RAM, number, etc. **Software version** Version of relevant…

18140663659 updated 1 year ago

hahnyuan/PB-LLM #6

Why the importance of weights evaluated by w^2/(H_ii)^2, ins…

Xingrun-Xing updated 7 months ago

NVIDIA/TensorRT-Model-Optimizer #17

What's impact from large tp and pp?

Hello, I find it extremely slow to do sparsegpt with tp=1 and pp=1. Will a larger number help? Thank you!

aiiAtelier updated 6 months ago

68 results for sparsegpt

68 results
for sparsegpt