local-attention Search Results

1000+ results
for local-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e4exp/paper_manager_abstract #309

Scaling Local Self-Attention for Parameter Efficient Visual …

- https://arxiv.org/abs/2103.12731 - 2021 Self-attentionは、パラメータに依存しない受容野のスケーリングとコンテンツに依存した相互作用により、コンピュータビジョンシステムを改善することが期待されていますが、畳み込みのパラメータ依存のスケーリングとコンテンツに依存しない相互作用とは対照的です。自己注意モデルは、ResNet-50のよう…

e4exp updated 3 years ago
2
huggingface/transformers #33680

save_pretrained is changing the name of module when saving

### System Info - `transformers` version: 4.44.2 - Platform: macOS-15.1-arm64-arm-64bit - Python version: 3.10.14 - Huggingface_hub version: 0.23.3 - Safetensors version: 0.4.3 - Accelerate vers…

ZhiyuanChen updated 3 weeks ago
4
bghira/SimpleTuner #1090

Training flux lora on small images

I want to train flux lora on small text crops. And usually the size of these crops are small. So my multidatabackened.json looks like this [ { "id": "pseudo-camera-10k-flux", "type…

preethamp0197 updated 2 days ago
1
TransformerLensOrg/TransformerLens #737

[Bug Report] Q cannot be reshaped correctly when model is lo…

**Describe the bug** Query_input's shape is [batch, pos, n_heads, d_model], and the purpose of the code where the error occurred is to reshape query_input to [batch, pos, n_heads, d_head]. I found t…

po13on updated 2 weeks ago
4
jax-ml/jax #23349

`jax.nn.dot_product_attention` does not respect `key_value_s…

### Description Perhaps I am using this function incorrectly, but I get data leaks when using `key_value_seq_lengths`. It appears as though both the `xla` and `cudnn` implementations in jax nightly…

danjenson updated 1 month ago
5
axolotl-ai-cloud/axolotl #1905

Running Example on Free T4 GPU through Google Colab

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ###…

hammad93 updated 2 weeks ago
8
unslothai/unsloth #1026

Unsloth & XFormers keep crashing on each other !

pip install "unsloth[cu121-ampere-torch240] @ git+https://github.com/unslothai/unsloth.git" pip install "unsloth[cu118-ampere-torch240] @ git+https://github.com/unslothai/unsloth.git" pip install "u…

thusinh1969 updated 1 month ago
4
CarperAI/trlx #601

OOM error with PEFT LoRA on Llama2-7B

### 🐛 Describe the bug I'm trying to finetune Llama2-7B (to reproduce the experiments in a paper) using PEFT LoRA (0.124% of trainable params). However, this results in an out-of-memory (OOM) error o…

arpaiva updated 1 month ago
1
microsoft/DeepSpeed-MII #472

Cannot run Yi-34B-Chat => ValueError: Unsupported q_ratio: 7

Hi DeepSpeed teams, Thank you for your great work! As the title suggests, the "01-ai/Yi-34B-Chat" model cannot run properly with DeepSpeed-MII version 0.2.3. The encountered error message is …

joeking11829 updated 2 months ago
3
MILVLG/bottom-up-attention.pytorch #117

error: command '/usr/local/cuda/bin/nvcc' failed with exit c…

Hello everyone, so i am trying to extract features from images for my project and getting this error again and again. I have successfully installed detectron2 and getting this error when trying to…

Abhayy-Kumar updated 4 days ago
3

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for local-attention

1000+ results
for local-attention