gqa Search Results - Githubissues

1000+ results
for gqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

meta-llama/llama #635

GQA for smaller models

Hello, could we please have 13b and 7b models with the updated architecture that includes grouped query attention? A lot of people are running these models on machines with low memory and this woul…

Dampfinchen updated 2 months ago
1
ashkamath/mdetr #47

KeyError: 'gqa_accuracy_answer_total_unscaled'

This mistake is really strange... I follow [the readme](https://github.com/ashkamath/mdetr/blob/main/.github/clevr.md) for training MDETR on CLEVR. Firstly, I've ran the following command: ``` pyth…

TopCoder2K updated 2 months ago
3
google/jax #22620

Relax shape constraint in `dot_product_attention` to allow M…

Many modern architectures use either GQA or MQA rather than MHA, but `dot_product_attention` allows only MHA by enforcing `query`, `key` and `value` should have the same number of heads: https://gi…

monatis updated 1 month ago
1
ronilp/mac-network-pytorch-gqa #5

GQA Accuracy

Could you please tell me about the accuracy of the model under the GQA task？ I only reached 45%

HellwayXue updated 3 years ago
3
huggingface/transformers #28425

GQA Llama 13B slower than Llama 13B without GQA

### Feature request It would be nice if when I choose different key_value_heads (key_value_heads < attention_heads) on config's model, automatically the attn weights were computed by mean pooling. …

Adonai02 updated 8 months ago
2
rlqja1107/torch-LLM4SGG #8

TypeError: __init__() missing 4 required positional argument…

GQA dataset test

hanzefang updated 2 months ago
1
KaihuaTang/Scene-Graph-Benchmark.pytorch #19

GQA datasets support?

## ❓ Questions and Help In your configs, I saw there exist difference between VG and GQA. But I cannot find the support for the GQA dataset.So any ideas about the GQA support?

runzeer updated 2 years ago
4
airsplay/lxmert #52

GQA submission

I generated the `submit_predict.json` and submited it to GQA evaluation server. However, I got an accuracy of 0 in test phase, but the result in dev phase makes sense. Is it possible that I predict al…

zaynmi updated 4 years ago
18
SHI-Labs/VCoder #6

Error when evaluating on GQA dataset

Hello, I was attempting to evaluate the model on the GQA dataset by following the instructions provided in the [Getting Started guide](https://github.com/SHI-Labs/VCoder/blob/main/docs/Getting_Star…

qncsn2016 updated 5 months ago
1
AILab-CVC/YOLO-World #422

Finetune question

Dear author, now if I want to add a GQA dataset for training, what do I need to do exactly?

zhujiajian98 updated 2 months ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gqa

1000+ results
for gqa