attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ASTR2023/ASTR #3

Some problem about Spot-Guided Attention

作者您好，关于论文中的Equation(4)我有一些疑惑： ![](https://latex.codecogs.com/svg.image?\mathrm{S}_{\mathrm{sim}}(p)=\underset{i}{softmax}(\{\langle&space;F_{ref}^{1/8}(p),F_{s&space;r&space;c}^{1/8}(p_i)\rangle\}_{p…

Master-cai updated 2 weeks ago
6
Dao-AILab/flash-attention #1218

Support sliding window attention in FA3

Thank you for the great work on FA3. I am wondering if FA3 will support sliding window attention soon as FA2 does?

lin-ht updated 1 month ago
3
NVIDIA/TensorRT-LLM #2372

XQA kernel works slower with fp8 kv than with fp16 kv on H10…

### System Info Hi! I'm running speculative execution TRT-LLM engine with 4 or 5 generation length, and I noticed that fp8 kv cache attention works slower than fp16 kv cache attention. Would be grea…

ttim updated 6 days ago
2
microsoft/unilm #1640

Is there any torch library based Differential Transformer co…

Hi, Im looking around Differential Transformer paper and code, I found that github version is based on flash attention and rotary embedding. I wonder that is there any plan to upload simple example …

DevKiHyun updated 2 weeks ago
2
nissl-lab/npoi #1428

Pay attention, freelancers! Kdot UK company is a scammer.

I got a small freelancer deal from [Kdot UK (aka Waterlily Labs](https://www.waterlilylabs.com/) early this year. The whole R&D team of this company is located in Sri Lanka. The company is so poor…

tonyqus updated 2 days ago
2
EleutherAI/gpt-neox #851

block-sparse flash attention support

I saw flash attention was recently merged. This approximate attention would be cool to have as well for training very large sequence lengths. https://github.com/HazyResearch/flash-attention/blob/m…

jordiclive updated 2 weeks ago
4
ksugar/samapi #30

Segmentation is pretty slow

Hi, thanks a lot for providing SAMAPI extension for QuPath! While the extension does work, the segmentation takes a lot of time. I have noticed the following user warning: `UserWarning: Flash …

mirickmi updated 2 weeks ago
1
Standard-Intelligence/hertz-dev #11

How to run Inference?

Config: Windows 10 with RTX4090 All requirements incl. flash-attn build - done! Server: ``` (venv) D:\PythonProjects\hertz-dev>python inference_server.py Using device: cuda Loaded tokeniz…

SuperMaximus1984 updated 1 day ago
6
Dao-AILab/flash-attention #1272

Six Flash-Attention-3 unit tests fail on H20

Hi, I installed Flash-Attention-3 from source and run test_flash_attn.py. I found that 5946 UT pass and 6 UT fail. Would you please help me solve these problems? My env: + source code commit id:…

cailun01 updated 3 weeks ago
4
zhuzilin/ring-flash-attention #54

verify causal masking

Hi @zhuzilin, follow up from https://github.com/zhuzilin/ring-flash-attention/issues/15 I just wanted to verify the causal, and I simply use loop because I dont have multigpus, but it should be wor…

huseinzol05 updated 4 weeks ago
6

上一页 1...23 24 25 26 27 28 29...100 下一页

1000+ results for attention

1000+ results
for attention