issues
search
decoding-comp-trust
/
comp-trust
Codebase for decoding compressed trust.
MIT License
20
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
It is extremly slow when inferecing with llama2-13b and llama2-13b-chat
#3
coolknow
opened
2 months ago
1
How did you compress model by using awq for 3bit and 8bit?
#2
coolknow
closed
3 months ago
4
Sorry, where is "conversation"?
#1
JohnneyQin
closed
6 months ago
1