gqa Search Results - Githubissues

intel/intel-extension-for-pytorch #696

torch.xpu for GQA

### Describe the issue Hi I come from https://github.com/vllm-project/vllm/issues/6701. I am wondering when will the 2.3.110 IPEX be released.

liuxingbin updated 1 week ago

OpenGVLab/InternVL #576

[Docs] Can't evaluate GQA testdev.

### 📚 The doc issue I don't think it's possible to get the structure of the dataset as depicted below in the diagram as shown in the diagram. ### Suggest a potential alternative/fix I don't k…

Zzzzzzzzzzj updated 1 day ago

horseee/LLM-Pruner #64

Adaptation of GQA

Thank you for your solid work. I would like to ask if the current version is suitable for GQA architecture models, such as LLaMA-2-70B and LLaMA-3.

junzhang-zj updated 1 month ago

mit-han-lab/Quest #8

Support for bsz>1 and GQA

Great job! We found that Quest is implemented on the previous version of flashinfer and some common feature are not support currently. * bsz > 1 * GQA * CUDA graph Is there any plan to update t…

Ryanuppp updated 1 week ago

bknyaz/sgg #11

GQA-SGCls-1 checkpoint link is not working

Great job! I found that the checkpoint link about GQA is not working, e.g. "GQA-SGCls-1 checkpoint". Could you please re-upload the "GQA-SGCls-1 checkpoint"? Thanks again for the work you do!

hujunming0625 updated 1 day ago

microsoft/onnxruntime-genai #880

[feature request] builder to expose {GQA, MHA} selection as …

Currently these are inferred from the combination of other configurations such as device and dtype. It is more flexible for downstream users if this can be selected by choice.

BowenBao updated 1 week ago

fkodom/grouped-query-attention-pytorch #4

gqa model runtime > no gqa model runtime

Hi!, I'm trying to replicate your implementation with Llama 2-13B and 7B, but curiously the runtimes didn't make sense (llama 2 gqa > llama 2 WITHOUT gqa) there is a little difference between my code …

Adonai02 updated 3 months ago

dongxingning/SHA-GCL-for-SGG #15

one drive link for GQA dataset and GQA pretrained object det…

I cannot find any files in the GQA dataset split link and GQA pretrained object detector OneDrive link. Can you check please? Thank you.

JeonJaeHyeong updated 2 weeks ago

cientgu/InstructDiffusion #21

What is meta_info.json for GPA gqa-inplant dataset not found

YerongLi updated 3 weeks ago

FasterDecoding/SnapKV #16

Question on GQA implementation

In GQA, only one copy of kv cache will be saved for each group, but snapKV saves kv cache with `num_key_value_heads * num_key_value_groups` heads. Indeed in kv cache eviction, the choice might be diff…

cyLi-Tiger updated 3 months ago

1000+ results for gqa

1000+ results
for gqa