sparsegpt Search Results

64 results
for sparsegpt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/ggml #93

Sparsity - The next performance frontier.

Great work going on with GGML. Bravo to so many contributors. You are champions! Maybe more performance (on CPU) can be had with bringing sparsity into the workflow. Here is one of the many efforts…

JohnnyOpcode updated 1 year ago
2
IST-DASLab/sparsegpt #19

Dependencies are wrong

Hello, I have tried lots of different version combinations to make the LLaMA script work, it produces very bad results which is also what I observed with my own implementation and some other implemen…

MrGranddy updated 1 year ago
3
locuslab/wanda #67

AttributeError: 'NoneType' object has no attribute 'to' when…

Hi @Eric-mingjie, I am also facing the same issue (as [#51]) when trying to prune llama-2-7b-chat-hf Here's the command `python main.py --model meta-llama/Llama-2-7b-chat-hf --prune_method…

NamburiSrinath updated 1 week ago
4
locuslab/wanda #38

Some question about the code

Hi! Thanks for your great work! Im a little confused about the implementation. Your simple and efficient work only requires once forward caculate to get the activation of each layer. [This line](ht…

liuxiaozhu01 updated 6 months ago
2
AndrewMead10/1KTPS #1

Error: The factorization could not be completed because the …

When I try the SparseGPT, it raises the error: ┌───────────────────── Traceback (most recent call last) ─────────────────────┐ │ E:\pythonProject\pruning.py:77 in │…

thaumstrial updated 1 year ago
1
locuslab/wanda #30

Publish the Llama2 sparsified models

Hi, I was wondering if you plan to put in a public domain the sparsified Llama2 models. In particular I am interested in the Llama2-70B with 50% unstructured sparsity. Thanks!

egeor updated 9 months ago
4
EleutherAI/gpt-neox #1058

Interoperability and GPT-NeoX

With the increasing interest in using this library to train models originally trained by others (https://github.com/EleutherAI/gpt-neox/issues/896 https://github.com/EleutherAI/gpt-neox/issues/994 htt…

StellaAthena updated 10 months ago
2
locuslab/wanda #7

this method can be used to bloom model?

18140663659 updated 11 months ago
3
zhangjun/zhangjun.github.io #32

LLM

# repo链接 https://github.com/THUDM/ChatGLM-6B https://github.com/mymusise/ChatGLM-Tuning https://github.com/LianjiaTech/BELLE ## LLM量化 https://zhuanlan.zhihu.com/p/616969812 - [SmoothQuant](htt…

zhangjun updated 10 months ago
9
AlibabaResearch/flash-llm #8

Incorpopration with llama 2?

Is it possible to use this in llama 2? I'm interested in improving the inference speed so the accuracy loss doesn't matter right now

BDHU updated 11 months ago
5

上一页 1...1 2 3 4 5 6 7...7 下一页

64 results for sparsegpt

64 results
for sparsegpt