mha Search Results - Githubissues

Artur-Galstyan/kira #10

Using equinox versions of MHA/RoPE

Hi - nice work on this. Now that your MHA/RoPE implementations are in equinox, do you plan to make a kira version that uses those directly? For one I thing I notice the API is a little different here …

davidaknowles updated 1 month ago

sbonaretti/pyKNEEr #18

morphology_functions.py question?

HI, Do you know why I will receive the below error message with 01_cubeQuant_01_prep_fc.mha, there is no problem with 01_DESS_01_prep_fc.mha. With 01_cubeQuant_01_prep_fc.mha (but not 01_DESS_…

ylim99 updated 2 weeks ago

deepseek-ai/DeepSeek-V2 #19

MLA vs MHA

Hello, great work. I want to know why the performance of MLA is better than that of MHA. I think MLA is a approximate low-rank decomposition for MHA.

jiangix-paper updated 4 months ago

4pygmalion/cosas #117

submission

#### History - Task2 ``` res = { "error": "", "means": { "dice": 0.6574674234095118, "jaccard": 0.5470246182762484, "accuracy": 0.7255573866247164 }, "score": 0.602246020…

4pygmalion updated 1 month ago

microsoft/onnxruntime-genai #880

[feature request] builder to expose {GQA, MHA} selection as …

Currently these are inferred from the combination of other configurations such as device and dtype. It is more flexible for downstream users if this can be selected by choice.

BowenBao updated 2 weeks ago

NVIDIA/TensorRT #4167

No MHA (muti head attention) kernal is called in Tensorrt 10…

# Description Use exact ONNX file `attention_ln_opset13.onnx` from https://github.com/NVIDIA/TensorRT/issues/3575#issuecomment-1874776406 Attention is like ![Image](https://github.com/user-attachment…

steventu27 updated 11 hours ago

PlusToolkit/PlusMatlabUtils #6

Updated some incompatibilities

Handled the case where the filename is not passed into mha_read_volume, which caused an error in calling mha_read_header with an undeclared variable (filename) fopen doesn't seem to need the filename…

rmustakos updated 1 month ago

deepseek-ai/DeepSeek-V2 #23

Failure to reproduce MLA > MHA

I tried out MLA and it was a good amount worse than MHA and wanted to try to find out why. Firstly, I am using a hybrid model therefore I am not using any Rope in either MLA or MHA, and therefore use …

faresobeid updated 4 months ago

is00hcw/mysql-master-ha #117

MHA Scalability

``` Hello, We are planning to run the MHA at our company, however we are wondering with one MHA Manager how many nodes/apps we can monitor? 100s, 1000s etc. We have quite a few nodes and wanted to …

GoogleCodeExporter updated 8 years ago

zhouyh139/mysql-master-ha #117

MHA Scalability

GoogleCodeExporter updated 9 years ago

1000+ results for mha

1000+ results
for mha