efficient-attention Search Results

1000+ results
for efficient-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bmaltais/kohya_ss #2705

[BUG] Can't Resume Training if "DoRA Weight Decompose" is ch…

I tested LyCORIS/LoCon preset for SDXL and it works fine, but if the box "DoRA Weight Decompose" is checked (with or without extra algorithms) then it will not continue. I tried default preset with 'G…

Mast3rBlast3r updated 3 weeks ago
2
Lightning-AI/lightning-thunder #950

sdpa_ex - Incorrect device in trace vs from actual computati…

`sdpa_ex` implementation of `torch.nn.functional.scaled_dot_product_attention` returns all output tensor proxy in trace to be on `cuda` but at runtime some outputs are on `cpu`. Repro ```python i…

kshitij12345 updated 8 hours ago
4
invoke-ai/InvokeAI #6710

[bug]: Sliced Attention: `TypeError: '>' not supported betwe…

### Is there an existing issue for this problem? - [X] I have searched the existing issues ### Operating system Linux ### GPU vendor Nvidia (CUDA) ### GPU model _No response_ #…

psychedelicious updated 1 month ago
1
pytorch/pytorch #134756

Flexattention: creating mask[1,1,96000,96000] causes OOM err…

### 🐛 Describe the bug in the blog https://pytorch.org/blog/flexattention/, it says `For example, for a sequence length of 1 million, the BlockMask would only use 60MB of additional memory`. howev…

foreverpiano updated 2 days ago
6
unslothai/unsloth #747

NotImplementedError

I want to fine-tune a model using unsloth. Every thing works fine on colab but on my system I got the following: { "name": "NotImplementedError", "message": "No operator found for `memory_efficie…

Keramatfar updated 1 month ago
1
facebookresearch/xformers #761

document of memory_efficient_attention is wrong

```python def memory_efficient_attention( query: torch.Tensor, key: torch.Tensor, value: torch.Tensor, attn_bias: Optional[Union[torch.Tensor, AttentionBias]] = None, p: floa…

Sleepychord updated 1 year ago
5
YanjieZe/GNFactor #10

xFormers can't load C++/CUDA extensions. xFormers was built …

I got an error when starting gnfactor training: my env setting is below GPU: nvidia l40s cuda: 11.7 ( in a docker environment) ubuntu: 22.04 it seems that xformers is incompatible with torc…

kjeiun updated 1 month ago
1
deeplearning-wisc/dream-ood #18

After downloading xformers==0.0.13, the code still reports a…

I have installed xformers==0.0.13, but when executing the build script, I still get an error "RuntimeError: No such operator xformers::efficient_attention_forward_generic - did you forget to build xfo…

TaicaiChen updated 3 weeks ago
1
facebookresearch/xformers #679

Memory efficient way to encode varying input sequence length…

Hi, I have the issue that my sequences are strongly varying in length, with sometimes having outliers that are an order of magnitude longer than the average. In default pytorch one can only pass a pa…

thoglu updated 2 months ago
13
pixeli99/SVD_Xtend #39

[CUDA out of memory] training in 1024 × 576 resolution in th…

Hi, Thanks for any suggestions. The largest resolution that could be used for training is 512 × 512 with ~76G memory cost. I set the enable_xformers_memory_efficient_attention to True but nothing ch…

CallMeFrozenBanana updated 1 week ago
4

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for efficient-attention

1000+ results
for efficient-attention