self-attention Search Results

1000+ results
for self-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Visual-Attention-Network/SegNeXt #71

The difference between the code and the legend in the paper

I noticed that in the code.In the MSCASpatialAttention,There is the following code def forward(self, x): """Forward function.""" shorcut = x.clone() x = self.proj_1(x) …

wugggfff updated 2 days ago
1
Standard-Intelligence/hertz-dev #11

How to run Inference?

Config: Windows 10 with RTX4090 All requirements incl. flash-attn build - done! Server: ``` (venv) D:\PythonProjects\hertz-dev>python inference_server.py Using device: cuda Loaded tokeniz…

SuperMaximus1984 updated 2 weeks ago
8
JianhongBai/UniEdit #8

tensor shape mismatch error when running `null_inversion.inv…

when I do `null_inversion.invert()`, following error occurs: ``` Traceback (most recent call last): …

Vincent-luo updated 1 week ago
1
OpenBMB/MiniCPM-V #643

[BUG] <title> Inference error. Replacing the LLM part with L…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

CCRss updated 1 week ago
7
karpathy/nanoGPT #567

How best to implement a differential transformer?

I'm not sure issues is the greatest place to post this but I just wanted to see if anyone else had been trying this idea: There was [a paper that came out recently](https://arxiv.org/abs/2410.05258…

Wilsontomass updated 3 weeks ago
2
helloNarehase/CoreML_Llama3 #2

python code that generate text from converted CoreML model

Hello, I'm new to transformers and coreml and I have converted the model Llama-3.2-1B-Instruct from: https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct. to coreml model using `python conver…

jean-anton updated 2 days ago
8
vllm-project/vllm #9098

[RFC]: hide continuous batching complexity through forward c…

### Motivation. take a look at the current llama forward computation logic: ```python class LlamaMLP(nn.Module): def forward(self, x): gate_up, _ = self.gate_up_proj(x) x…

youkaichao updated 1 month ago
3
EnVision-Research/LucidFusion #4

Error when run with crm=true

Setting up MemoryEfficientCrossAttention. Query dim is 320, context_dim is None and using 5 heads. Setting up MemoryEfficientCrossAttention. Query dim is 320, context_dim is 1024 and using 5 heads. …

sekkit updated 2 weeks ago
1
vllm-project/vllm #9551

[Usage]: Custom LLM Generate

### Your current environment ```text The output of `python collect_env.py` ``` ### How would you like to use vllm I'm implementating a custom algorithm that requires a custom generate met…

Blaizzy updated 1 month ago
12
0809zheng/0809zheng.github.io #19

（着急！！！）写的很好，但是公式显示的不正常

比如：https://0809zheng.github.io/2020/04/24/self-attention.html

silenceli updated 3 days ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for self-attention

1000+ results
for self-attention