-
Traceback (most recent call last):
File "train.py", line 527, in
training(lp.extract(args), op.extract(args), pp.extract(args), dataset, args.test_iterations, args.save_iterations, args.chec…
-
Not really an issue, but more of a question / idea.
At my work we handle quite a lot of images, and have therefore looked at a few solutions to improve perceived loading speed for them. I don't kno…
-
Hello,
Thanks for a very useful and well documented package. Great effort!
I have tried to train the model with flash attention enabled, and ran into the following error:
```
....
File "/…
-
## Description
Hello, there are a lot of functions from torch package not implemented (wrapped) in DJL.
For example batch matrix-matrix product operation: TORCH.BMM
It's looks like it should be w…
-
在下面的代码中, 我觉得应该表明为什么 Q, K, V 向量序列是等于 inputs_embeds 的, 我理解的是注意力机制中的 QKV 是 embedding 与 W_Q 和 W_K , W_V 这三个矩阵相乘得到的, 这三个矩阵也是超参数, 而下面的代码是好像默认 这三个矩阵是单位矩阵.
`import torch
from math import sqrt
Q = K = V…
-
After blind merged-mining some sidechain blocks, the blocks are sometimes then orphaned later - in one case (and possibly all cases) after I turn off automated BMM. The block height continues to incre…
-
@wolandscat In [basic types bmm](https://raw.githubusercontent.com/openEHR/reference-models/master/models/openEHR/Release-1.0.2/BMM/openehr_basic_types_102.bmm), mapping property of DV_TEXT is define…
-
begin to convert:
Warning: Encountered known unsupported method torch.bmm
[TensorRT] ERROR: INVALID_ARGUMENT: Cannot find binding of given name: input_0
tensor(2.24113e-05, device='cuda:0', grad_f…
-
The common/script/bmm_gen.py script fails to run with “inconsistent use of tabs and spaces in indentation” on python 3.6.5.
-
I try to realise the FA loss after your answers。But I met some questions in relation graph 。
my test code is
x = np.random.random((256, 64, 64))
y = np.random.random((256, 64, 64))
y = torch.from…