local-attention Search Results

1000+ results
for local-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

md-mohaiminul/VideoRecap #14

GPT2LMHeadModel does not support an attention implementation…

`--------------------------------------------------------------------------- ValueError Traceback (most recent call last) [](https://localhost:8080/#) in () 19 …

hanjidani updated 5 days ago
3
infocusp/varformers #2

Implement Relative Positional Multi-Head Attention in Transf…

## Description: Hello! I’ve been following the development of this repository and appreciate the efforts to benchmark various efficient Transformer variants. I’d like to propose the implementation of…

rajveer43 updated 1 month ago
1
pytorch-labs/gpt-fast #172

Missing Keys in state_dict

I downloaded `nvidia/Llama3-ChatQA-1.5-8B` manually from HF into local. I ran `scripts/convert_hf_checkpoint.py` Then I wanted to run generate.py using the local checkpoint dir: ` raise RuntimeE…

bjohn22 updated 1 month ago
2
GPlates/gplately #299

Reproducing the 04-VelocityBasics results in a velocity of 0…

I am writing to seek your assistance with an issue I've encountered while running the "04-VelocityBasics" on my local machine. Upon executing the associated diagram, I've noticed that the scatter plot…

a146835145 updated 2 hours ago
1
karpathy/nn-zero-to-hero #61

flash attention: simplified

I tried to imitate your educational coding style hehe Here's a pure Pytorch implementation of Flash Attention, hope you like it @karpathy ``` def flash_attention(Q, K, V, is_causal=True, BLOCK_S…

ghost updated 1 month ago
2
Vahe1994/AQLM #128

Flash attention 2 doesn't work

Hello! The `main` (`a441a3f`) branch of the AQLM repository does not support `flash attention 2`. The error occurs because QuantizedWeight does not have a weight attribute ([closed issue #31](https…

ArtemBiliksin updated 2 weeks ago
1
lyogavin/airllm #192

it is run

!pip install -U airllm !pip install -U bitsandbytes !pip install git+https://github.com/huggingface/transformers.git !pip install git+https://github.com/huggingface/ac…

werruww updated 4 hours ago
11
pytorch-labs/attention-gym #26

`error: 'tt.broadcast' op requires the same encoding for all…

Hi, Thank you for providing this collection! I'm trying to get local window attention to run. I managed to have a simple example running locally as shown in #15, but I am facing problems now when …

fteufel updated 4 weeks ago
15
BAAI-DCAI/M3D #25

Fail to run M3DClip following the example on huggingface

Hi, thank you for your awesome works. However, when I was trying to run the M3DClip model using code on huggingface I have some errors related to the einops lib. I noticed you use the monai ViT layers…

Thedatababbler updated 4 days ago
2
huggingface/trl #2217

[GKD] 0 loss

### System Info ``` pip install git+https://github.com/huggingface/transformers.git pip install tokenizers==0.20.0 pip install accelerate==0.34.2 pip install git+https://github.com/huggingface/tr…

nivibilla updated 18 hours ago
4

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for local-attention

1000+ results
for local-attention