issues
search
opengear-project
/
GEAR
GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM
MIT License
116
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Evaluation Code Produces Identical Results with Different Caching Methods
#17
mohsenhariri
opened
5 days ago
1
proper unicode added
#16
mohsenhariri
closed
4 days ago
0
UnicodeEncodeError when writing non-ASCII characters to a file
#15
mohsenhariri
closed
4 days ago
1
Questions about zero-shot
#14
YcChou
closed
16 hours ago
1
Where is the outlier extraction logic in `cuda_support_gear`
#13
Ther-nullptr
closed
16 hours ago
1
How to reproduce GEAR on Mistral models
#12
CUHKSZzxy
closed
16 hours ago
2
Question about LowRank
#11
shhn1
opened
4 weeks ago
2
Questions about the code structure
#10
CUHKSZzxy
closed
16 hours ago
3
Question about the shell commands details on how to reproduce the main results of COT and zeroshot performance
#9
zoominguniverse
closed
1 week ago
2
Can't reproduce the benchmarks
#8
cyLi-Tiger
closed
1 month ago
2
Qustion about storage
#7
mlxht990720
opened
2 months ago
2
How to eval GEAR with lm-eval framework?
#6
ThisisBillhe
closed
1 week ago
2
[Bug] maybe a bug in fake_quant_error_simulation function
#5
HarryWu99
closed
2 months ago
1
questions about GenerationTest folder
#4
hzfengfengxia
closed
2 months ago
2
questions about rapids folder
#3
hzfengfengxia
closed
3 months ago
2
Question about storage overhead of sparse matrix S
#2
ThisisBillhe
closed
2 months ago
3
Public cod Refined
#1
HaoKang-Timmy
closed
4 months ago
0