issues
search
dvlab-research
/
Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
https://arxiv.org/abs/2406.07528
38
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
benchmark/download.py is missing
#3
zetian1025
closed
1 month ago
1
AttributeError: 'RotaryEmbeddingESM' object has no attribute 'shape'
#2
pengshuang
opened
3 months ago
1
question_ids如何设置?
#1
MrXiaoaa
opened
3 months ago
2