dvlab-research Q-LLM issues - Githubissues

dvlab-research / Q-LLM

This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"

https://arxiv.org/abs/2406.07528

38 stars 1 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

benchmark/download.py is missing

#3 zetian1025 closed 1 month ago
1
AttributeError: 'RotaryEmbeddingESM' object has no attribute 'shape'

#2 pengshuang opened 3 months ago
1
question_ids如何设置？

#1 MrXiaoaa opened 3 months ago
2