attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Dao-AILab/flash-attention #985

Does flash attention support FP8?

FP8 is very useful in training or inference in LLM. Does flash attention support FP8? Thank you~

Godlovecui updated 2 months ago
4
TheMistoAI/MistoControlNet-Flux-dev #12

NotImplementedError:No operator found for `memory_efficient_…

There's an error message at the KSamplerTheMisto node, and the workflow can't complete. My GPU is a 2080ti. "NotImplementedError: No operator found for `memory_efficient_attention_forward` with inp…

felexsh updated 2 weeks ago
4
erc-dharma/project-documentation #311

attention to duplicate ZSTs

Yesterday I encountered this message: I did what I normally do, i.e. search the Zotero Short Title in Zotero standalone, merge the two items, and then synchronise, but it did not make the error…

arlogriffiths updated 3 months ago
4
SolardiaX/DarkUI #5

Framedrop and Freezes

Great interface, I really like it. But every few seconds I get a 10/15 frame drop causing tearing on the screen. Since I started using it WOW randomly freeze in Instances. Disabling DarkUI …

Marih-GitHub updated 15 hours ago
2
ValveSoftware/gamescope #1488

Frequent crashes while using WSI layer and HDR

The crashes are frequent and intermittent, sometimes only steam is open, no game at all, and it crashes. terminal output ``` [gamescope] [Info] console: gamescope version undefined ATTENTION:…

ptkato updated 3 weeks ago
1
Lightning-AI/litgpt #1552

Mistral v0.1 sliding window attention

Opening this issue so we don't forget: Once #1545 is merged, let's also add sliding window attention to Mistral 0.1

rasbt updated 2 months ago
1
Lightning-AI/lightning-thunder #1153

test_vjp_correctness_sdpa_manual_grad_forward_scaled_dot_pro…

See this excerpt from https://dev.azure.com/Lightning-AI/lightning/_build/results?buildId=215225&view=logs&j=5b0799f7-725e-5b16-9b83-c0a5a25d03f0&t=97651ec4-0b0f-5455-bbb5-3c30427a0a7e ``` FAILED …

mruberry updated 1 day ago
3
OpenNMT/CTranslate2 #1771

build failed on jetson agx orin (Error generating file: buil…

I got the following error message when doing "make -j10" ``` CMake Error at ctranslate2_generated_flash_fwd_split_hdim96_fp16_sm80.cu.o.Release.cmake:280 (message): Error generating file /work…

cyu021 updated 1 week ago
1
SHI-Labs/NATTEN #165

Why is the size of rpb [2K-1,2K-1] inconsistent to kernel s…

Thank you very much for your excellent work. I wonder rpb should be added to attention weight in shape of [K, K]. However, the shape of rpb in 2d NATTEN is [2K-1,2K-1]. It is hard for me to find the …

ACuOoOoO updated 1 day ago
7
cuda-mode/ring-attention #11

Compare ring-flash-attention & ring-attention-pytorch

lucidrains & zhuzilin were hard working the last days and have completed the following two ring-attention implementations: - [lucidrains/ring-attention-pytorch](https://github.com/lucidrains/ring-a…

andreaskoepf updated 6 months ago
2

上一页 1...28 29 30 31 32 33 34...100 下一页

1000+ results for attention

1000+ results
for attention