was-attention Search Results

1000+ results
for was-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #141010

FlexAttention throws in multi-GPU env: ValueError: Pointer a…

### 🐛 Describe the bug # Problem When running compiled FlexAttention in a multi-GPU environment, if the device being used is not the first GPU (i.e., not `cuda` or `cuda:0`, but `cuda:1`, etc.), a…

w568w updated 6 minutes ago
1
facebookresearch/xformers #1024

output from memory_efficient_attention not exactly the same …

# ❓ Questions and Help Hi, I tested memory_efficient_attention with the pytorch equivalent implementation in the doc, and found they are not exactly the same. The code: ``` def attention_e(self…

wangh09 updated 4 weeks ago
2
pytorch/pytorch #138556

FlexAttention result deviates with torch.compile() and torch…

### 🐛 Describe the bug Hi, I was testing FlexAttention by comparing its output with that of `nn.MultiheadAttention` and `torch.nn.functional.scaled_dot_product_attention`. In the end, I tracked down …

EIFY updated 2 weeks ago
10
linkedin/Liger-Kernel #319

Training LLaVA with the Liger kernel results in degraded per…

### 🐛 Describe the bug I attempted to train LLaVA (base LLM = LLaMA 3) using the Liger kernel. The loss graph was similar to when I trained LLaVA without the Liger kernel. However, the model traine…

y-rok updated 4 days ago
2
PygmalionAI/aphrodite-engine #774

[Installation]: AMD MI60 (gfx906) installation errors with R…

### Your current environment ```sh python env.py Collecting environment information... PyTorch version: 2.6.0.dev20241011+rocm6.2 Is debug build: False CUDA used to build PyTorch: N/A ROCM us…

Said-Akbar updated 1 month ago
10
ROCm/flash-attention #29

Mi50 Support

I was able to build flash-attention ROCM for both my Mi100 and Mi50 cards, but only got flash attention working on the Mi100(very impressive performance I might add). Trying to run flash attention …

YehowshuaScaled updated 2 weeks ago
5
frappe/erpnext #43617

Currency Issue on Consolidated Financial Reports for Client

### Information about bug The consolidate financial Statement report shows the balances in company currency, there is a filed that lets you set the currency for the report so you may view certain acc…

erpNari updated 1 month ago
3
open62541/open62541 #6800

Getting Peer Certificate

## Description CRL downloading from peer certificate. ## Background Information / Reproduction Steps I have this use case: 1. UA Client and UA Server each have their own certificate, and e…

xcouzch updated 2 weeks ago
1
NVIDIA/TensorRT-LLM #1490

chatglm2-6b smoothquant multi-tp build failed on 0.9.0 branc…

### System Info - CPU architecture: x86_64 - GPU properties - GPU name: NVIDIA A100 - GPU memory size: 40G - Libraries - TensorRT-LLM branch or tag: v0.9.0 -…

NaNAGISaSA updated 5 days ago
6
pinokiocomputer/pinokio #220

Videocrafter 2 1Torch error

When im trying to use Videocrafter 2 - i get this error : F:\Pinokio\api\videocrafter2.git\app\env\lib\site-packages\torch\nn\functional.py:5560: UserWarning: 1Torch was not compiled with flash att…

fremalm updated 1 month ago
1

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for was-attention

1000+ results
for was-attention