attention Search Results

1000+ results
for attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

erfanzar/EasyDeL #172

DPO trainer example

**Describe the bug** In trying DPO trainer example getting a bug with batch size and sharding , may be shard axis are not properly set or could be jax error as well , system used is V3 -32 , 4 hosts …

sparsh35 updated 1 week ago
11
vllm-project/vllm #9527

[Performance]: attention speed regression 0.6.0 => 0.6.3

### Report of performance regression I found the attention (flashattn.py) computation time increased 1.7x after upgrade vllm 0.6.0 to 0.6.3. | | v0.6.0 | v0.6.3 | | :----: | :----: | :----: | …

rayhuang90 updated 2 weeks ago
6
prusa3d/Prusa-Firmware-Buddy #4250

[BFW-6187] [BUG] API fails to report ATTENTION after heating…

### Printer model MK4 ### Firmware version 6.1.3 ### Upgrades and modifications _No response_ ### Printing from... PrusaConnect ### Describe the bug The API's /printer, /job, /v1/job all repo…

rtdog updated 3 weeks ago
1
huggingface/transformers #33373

Any plans on adding Flash Attention 3?

As title

mayank31398 updated 1 month ago
2
huggingface/transformers #34097

SlidingWindowCache issue

### System Info On main ### Who can help? @zucchini-nlp @gante ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially su…

Cyrilvallez updated 4 weeks ago
1
dotnet/TorchSharp #1375

Torch was not compiled with flash attention warning

This is printed when I call `functional.scaled_dot_product_attention`: > [W914 13:25:36.000000000 sdp_utils.cpp:555] Warning: 1Torch was not compiled with flash attention. (function operator ()) …

lostmsu updated 3 weeks ago
1
kijai/ComfyUI-LivePortraitKJ #146

LivePortraitCropper.process() missing 2 required positional …

I used to run this pipeline fine, but recently after a few updates and coming back to this exact workflow, I realized there are new issues, can anyone help ? thanks # ComfyUI Error Report ## Err…

kwang0429 updated 1 month ago
2
Graph-and-Geometric-Learning/hyperbolic-transformer #1

linear_focus_attention问题

在linear_focus_attention这部分，为什么不对v值进行phi_qs = (F.relu(qs) + 1e-6) / (self.norm_scale.abs() + 1e-6)类似的操作呢？因为我看到论文里公式（15）对Q_s、K_s和V_s都应用了Phi函数

chenying0722 updated 2 months ago
5
VamosC/CLIP4STR #26

How can I find the attention map

Hi, thanks for your contribution of the projects. My question is, how can i find the attention map of the predicted image?

bemjikim updated 1 month ago
1
clessig/atmorep #7

Update attention maps for flash attention and verify correct…

when running with `attention = True` the last match has the wrong values of latitude: min and max values of the latitude for that particular batch: ``` batch 10 = 135.0 - 146.25 batch 11 = 148.5…

iluise updated 2 months ago
1

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for attention

1000+ results
for attention