gqa Search Results - Githubissues

1000+ results
for gqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mistralai/mistral-inference #55

Batching, GQA and Flash Attnetion

Hello, Mistral Team! Congrats on open-sourcing your model and thanks a lot for your work! Being inspired by the memory- and compute-efficiency and benchmark performance of your model, I tried to re…

maximzubkov updated 12 months ago
1
ovguyo/captions-in-VQA #1

A question about the experiments

Really grateful for your work. I have a question about your experiments table2. are the last two columns directly using cogvlm and blip2 to do the vqa task? if so, `cogvlm vqa` can get the best result…

liangbingzhao updated 1 week ago
1
FMInference/H2O #32

No support of GQA of Llama in real_drop

In [modify_llama.py](https://github.com/FMInference/H2O/blob/main/h2o_hf/utils_real_drop/modify_llama.py), the hh_score of H2OCache is computed by attn_scores.sum(0).sum(1), resulting in a shape of [n…

Tomorrowdawn updated 5 months ago
1
tensorflow/datasets #300

[data request] General Question Answering (GQA)

* Name of dataset: General Question Answering (GQA) * URL of dataset: https://cs.stanford.edu/people/dorarad/gqa/ * License of dataset: https://creativecommons.org/licenses/by/4.0/ * Short descri…

dynamicwebpaige updated 5 years ago
4
szzexpoi/POEM #6

Prototype_annotations_problems?

Hello, first of all, thank you very much for providing the code. I have two questions: 1. Could you please let me know if the code mentioned in the description for generating annotations for other d…

codenewww updated 1 month ago
1
facebookresearch/chai #3

chai for llama2

Hi, have you tried implementing the Chai structure on models like LLaMA2 or any other models besides LLaMA and OPT? Looking forward to your response, thanks.

rippleD030 updated 2 weeks ago
1
AILab-CVC/YOLO-World #422

Finetune question

Dear author, now if I want to add a GQA dataset for training, what do I need to do exactly?

zhujiajian98 updated 3 months ago
3
QwenLM/Qwen2.5 #509

Qwen2 72B When using Chinese prompt words to generate sql in…

https://github.com/QwenLM/Qwen2/issues/259 qwen1.5测出的问题在qwen2仍然存在，出问题的模型应该都用了GQA

lordk911 updated 2 months ago
2
microsoft/Oscar #209

Azcopy failes with given SAS token

I am using the given SAS token to extract this YAML file > coco_flickr30k_googlecc_gqa_sbu_oi_x152c4big2exp168.yaml `./azcopy copy "https://biglmdiag.blob.core.windows.net/vinvl/pretrain_corpu…

meharbhatia updated 4 months ago
5
ldery/Bonsai #4

May I ask if this tool is currently unable to perform prunin…

May I ask if this tool is currently unable to perform pruning on GQA models? Llama2-70B or Llama3

yeliang2258 updated 4 months ago
4

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for gqa

1000+ results
for gqa