-
### params:
batch_size: 64, num_heads: 16, seq_len: 2048, dimension: 64, type:torch.float16, use mask, A800
### example:
```Python
def func(y, grad):
y.backward(grad, retain_graph=True)
…
Uwwal updated
1 month ago
-
### Describe the bug
GAT tries to uise subclassing to share tests between dense and sparse models, but accidentally defines a function instead of a subclass:
https://github.com/stellargraph/stel…
huonw updated
4 years ago
-
Hello,
Thanks for your great work.
In your paper, I can see the results of s3dis experiments but I cannot find the codes for it.
Could you provide the codes(prepare data, configs, dataset, etc ....…
-
The avatar icon with presence displays green when the presence is "Available Idle" and "Inactive" where MS Teams and the other MS apps display the yellow away icon. It corrects once the presence chan…
-
Dear developers,
I hope this message finds you well. Firstly, I would like to express my appreciation for your excellent work on the Soot-FlowDroid module. It has been instrumental in my recent ana…
-
# Prerequisites
Please answer the following questions for yourself before submitting an issue.
- [ Yes] I am running the latest code. Development is very rapid so there are no tagged versions as…
-
Can we train Graphormer to generate graphs, like what the GPT models do in text generation? Can we apply attention masks similar to the GPT models?
-
Hi, are we able to use it together with IPAdapter & Controlnet from Xlabs?
-
### Description
Run
```
import jax
import jax.numpy as jnp
dtype = jnp.bfloat16
batch_dim = 16
x_shape = (batch_dim, 512, 16, 48)
bias_shape = (batch_dim, 16, 512, 512)
mask_shape = (1, 1, …
-
# Provide required information needed to triage your issue
## Your Environment
* Platform [PC desktop, Mac, iOS, Office on the web]: Office on the web (on premises)
* Host [Excel, Word, PowerPo…