-
您好!我对在使用社交网络和兴趣网络更新用户表示过程中,注意力分数的计算有些疑问。
首先,
从以上代码可以看出 gama^(k+1)_(a1) =1/2* self.consumed_items_attention,gama^(k+1)_(a2) =1/2* self.social_neighbors_attention。gama^(k+1)_(a1)和gama^(k+1)_(a2)…
-
**Is your feature request related to a problem?**
This is related to compiling ONNX models for upload to opensearch
two problems with the status quo:
1. `transformers.convert_graph_to_onnx.conver…
-
Dear author:
Hello,
I am running your publication 'Bilateral Cross Modality Graph Matching Attention'
There were some errors in the source code of the paper 'For Feature Fusion in Visual Question A…
Ysis0 updated
9 months ago
-
I'm using torch 2.1.0.dev20230425+cpu and diffuser 0.16 to build stable diffusion v1_5, But I got the following error:
`assert (
AssertionError: Unsupported function type scaled_dot_product_atte…
-
### 🐛 Describe the bug
# reproduce the bug
@mstebelev found out that memory efficient attention kernel on float32 cuda tensors gives nan gradients despite inputs and incoming gradient are reaso…
-
### Current Behavior:
Dependency-Track v4.5.0 introduced support for [EPSS](https://www.first.org/epss/model). This is currently provided via the `"Exploit Predictions" tab in each project.
The s…
-
### System Info
- TensorRT-LLM version: 0.10.0.dev2024050700
(I doubt any other information is relevant)
### Who can help?
@kaiyux
### Information
- [ ] The official example scripts
- […
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [x] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing ans…
-
作者您好,这篇论文有参考代码嘛?
-
### Describe the issue
For following configs, the optimization failed with assertion error on `num_heads>0`
"transformer_optimization": {
"model_type": "bert",
"opt_level": 0,
…