graph-attention Search Results

1000+ results
for graph-attention

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #138980

compile_fx inplace modifly the input graph

### 🐛 Describe the bug I'm compiling a graph multiple times using inductor. I find it inplace modify the graph, and one of the graph's output changes from tensor to list of tensor. example code: …

youkaichao updated 5 days ago
8
pytorch/pytorch #139989

Support AC with graph break

### 🐛 Describe the bug As mentioned in this [blog](https://dev-discuss.pytorch.org/t/higher-order-operators-2023-10/1565), HigherOrderOperator does not support graph break inside the input/output fun…

yitongh updated 8 hours ago
1
thinger-io/thinger-server #115

Inserting a watermark into the Widget

The possibility of inserting a watermark into the widget is very interesting, especially when we need to draw the user's attention to a piece of information. In the example below, we have a sensor th…

georgevbsantiago updated 4 days ago
1
pytorch/pytorch #136756

Onnx scaled_dot_product_attention does not allow to export m…

### 🐛 Describe the bug The model is converted with dynamo and opset 18, torch nightly, and most recent onnx and onnxscript, # PyTorch ONNX Conversion Report ``` ✅ Obtain model graph with `torch…

doloresgarcia updated 1 month ago
4
pytorch/pytorch #139586

[CuDNN Attention] Performance Grouped Query Attention

# Summary We recently landed support for grouped query attention via use `enable_gqa` on sdpa, however this is only enabled on the flash attention backend. This leads to a weird situation where it c…

drisspg updated 5 days ago
4
wenet-e2e/wenet #2653

error in _update_kv_and_cache with conformer model

i tried exporting the stream conformer model to onnx format with below parameters. ``` python3 wenet/bin/export_onnx_gpu.py --config=$model_dir/train.yaml --checkpoint=$model_dir/final.pt --cmvn_fi…

anjul1008 updated 3 days ago
2
pytorch/pytorch #138529

SDPA: CUDNN backend error w/ q_seq_len = 1

## Summary Repro script ``` Python import torch import torch.nn as nn import torch.nn.functional as F q = torch.randn(1, 16, 1, 64, device="cuda", dtype=torch.bfloat16, requires_grad=True)…

drisspg updated 2 weeks ago
2
jacobgil/pytorch-grad-cam #482

How to Generate Attention Graphs on Custom Models ?

Hi, I'm working on the attention mechanism for face recognition models, I'm using the ir model as a backbone, but I don't know much about the details of the implementation of grad-cam, what exactly sh…

hsien999 updated 1 month ago
1
NVIDIA-AI-IOT/whisper_trt #12

failed to run with whisper 20240930

Hi, thanks for creating this package, it helps us to run whisper with tensorRT. however, we found that is package didn't include a dependency map (usually is done by requirements.txt) so we run wh…

hsinhoyeh updated 1 day ago
3
pytorch-labs/attention-gym #56

Optimal ordering with block mask

From my understanding, flex attention (using `block_mask`) gets faster when the number of empty blocks is larger. If the inputs (Q, K, V) do not represent sequences, but graphs with local connectivity…

francois-rozet updated 1 hour ago
9

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for graph-attention

1000+ results
for graph-attention