gqa Search Results - Githubissues

1000+ results
for gqa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #771

[Question] "Exception: Can't find testdev_balanced_questions…

### Question I modify the eval.py from https://gist.github.com/haotian-liu/db6eddc2a984b4cbcc8a7f26fd523187. But I get this error. How to solve it, thank you.

jiaxiangc updated 6 months ago
2
k0gen/serge #8

Sweep: make a PR to remove everything related to `n_gqa`

Take a look at the history of commits related to implementing `n_gqa` and undo/remove everything that is related to it Checklist - [X] `api/src/serge/utils/migrate.py` ✅ Commit [`ee41b29`](https:/…

k0gen updated 1 year ago
1
ankan-ban/llama2.cu #6

Probable case not considered

Hello, Have you checked for what happens when the n_heads != n_kv_heads? How does this affect the Rope rotation, MHA which now becomes GQA?

obhalerao97 updated 9 months ago
3
rajatkoner08/rtn #3

Question on setup steps

Hi Rajat, Suprosanna and Volker, I'm trying to use your code for RTN scene graph generation on GQA. Specifically, I'm looking at the gqa_1.4 branch, but I didn't see a `requirement.txt` there. A…

Bill1235813 updated 1 year ago
1
shenyunhang/APE #44

omegaconf.errors.ConfigAttributeError: ListConfig does not s…

When I try to run inference using **APE-Ti**, I get the following error: ``` Traceback (most recent call last): File "/home/jupyter/TIL-2024/vlm/train/APE/demo/demo_lazy.py", line 134, in d…

HoWingYip updated 5 months ago
1
stanfordnlp/mac-network #29

-bash: fork: Cannot allocate memory

Thanks for the repo On trying to evaluate the model using `python main.py --expName "gqaExperiment" --finalTest --testedNum 1000 --netLength 4 -r --submission --getPreds @configs/gqa/gqa.txt`, it a…

nishanth-vimalesh updated 5 years ago
1
alibaba/Pai-Megatron-Patch #354

qwen2.5转换脚本转换时报错

``` [rank0]: Traceback (most recent call last): [rank0]: File "Pai-Megatron-Patch-0925/toolkits/model_checkpoints_convertor/qwen/hf2mcore_qwen2_dense_and_moe_gqa.py", line 924, in [rank0]: m…

enze5088 updated 1 month ago
1
microsoft/onnxruntime #21334

[Build] How can I quantize the llama3 model activation to i…

### Describe the issue I’m trying to quantize a int4 model, but this file only provides the weight-only-quantization. If I can quantize both weight and activation to int4 ? https://github.com/micros…

zhangyu68 updated 2 months ago
1
abetlen/llama-cpp-python #526

model_path error with Llama-2

I am trying to execute the following script: 1. from llama_cpp import Llama 2. llm = Llama(model_path="~/llama-2-7b.ggmlv3.q8_0.bin", n_gqa=8) 3. output = llm("Q: Name the planets in the solar sy…

jadehardouin updated 4 months ago
3
princeton-nlp/LLM-Shearing #62

The Project is not implemented for 70B llama?

No GQA implementation is found, so the model is not capable to scale to 70B for composerLLAMA. Maybe we need design GQA and introduce head_z for wq and head_z_kv for wk and wv?

zhangzhenyu13 updated 7 months ago
7

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for gqa

1000+ results
for gqa