gqa Search Results - Githubissues

1000+ results
for gqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abetlen/llama-cpp-python #526

model_path error with Llama-2

I am trying to execute the following script: 1. from llama_cpp import Llama 2. llm = Llama(model_path="~/llama-2-7b.ggmlv3.q8_0.bin", n_gqa=8) 3. output = llm("Q: Name the planets in the solar sy…

jadehardouin updated 4 months ago
3
baichuan-inc/Baichuan2 #97

请问有计划提供GQA版本吗？

相关说明：大语言模型的推理成本很高，主要是由于加载键和值的内存带宽开销。分组查询注意（Grouped-Query Attention，GQA）是一种多查询和多头注意力的插值，它以与多查询注意力相当的速度实现了接近多头的质量。论文链接：https://arxiv.org/pdf/2305.13245.pdf

tsingcoo updated 1 year ago
1
AkihikoWatanabe/paper_notes #1271

GQA: Training Generalized Multi-Query Transformer Models fro…

# URL - https://arxiv.org/abs/2305.13245 # Affiliations - Joshua Ainslie, N/A - James Lee-Thorp, N/A - Michiel de Jong, N/A - Yury Zemlyanskiy, N/A - Federico Lebrón, N/A - Sumit Sanghai, …

AkihikoWatanabe updated 7 months ago
2
NVlabs/RelViT #2

Pretrained model

Can trained models be provided, especially on the GQA dataset.

KimWu1994 updated 1 year ago
12
undreamai/LLMUnity #245

Unable to load Hubble-4B

### Describe the bug Model URL: https://huggingface.co/bartowski/Hubble-4B-v1-GGUF/discussions/1 llama_model_loader: - kv 26: tokenizer.ggml.merges arr[str,280147] = ["Ġ Ġ"…

tempstudio updated 1 month ago
1
MILVLG/openvqa #84

CLEVR problem

Why is this project code reporting errors in the CLEVR dataset? The question is: Traceback (most recent call last): File "/mnt/public/home/s-xuk/mcan-gqa/run.py", line 160, in execution.run…

QA-x updated 11 months ago
1
karpathy/llama2.c #403

How to convert the huggingface model with GQA to bin?

I want to convert this small 1.1B llama2 architecture model [PY007/TinyLlama-1.1B-intermediate-step-240k-503b](https://huggingface.co/PY007/TinyLlama-1.1B-intermediate-step-240k-503b) to llama2.c vers…

tic-top updated 1 year ago
4
Vision-CAIR/MiniGPT-4 #493

How to reproduce the results of the MiniGPT-4 paper?

Hi, Is it possible to provide the details on how the first version was evaluated on benchmarks such as GQA or AOK-VQA in Table 6 of the paper? Thanks

nbarazani updated 8 months ago
1
AILab-CVC/YOLO-World #306

怎么能在保持开集效果的情况下，在下游任务数据集上微调呢？

试过了lora(可能没有lora的很好)，试过了冻结部分权重。请问下作者有建议吗？

IronmanVsThanos updated 1 month ago
10
NVlabs/RelViT #1

training time

Hello, could you please tell me the training time of hico and gqa in which gpu, thanks!

SISTMrL updated 2 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for gqa

1000+ results
for gqa