-
In model.py, have you applied the softmax function already? or Do I need to implement it? After training the model, it's giving Don't know, or I am sorry for most of the questions. Could attention mec…
-
backward implementation softmax
ypred - y
-
If you test Softmax instead scaling by 1/N
-
2. Ratio of `Triton`/ `XeTLA` keep same except for attention caused by `XeTLA` attention absolute number degraded
3. Both `Triton` and `XeTLA` softmax cases degraded, so `Triton`/ `XeTLA` not changed…
-
Task Q3 from cs231n course (https://cs231n.github.io/assignments2024/assignment1/)
Task formulation and "skeleton" of the solution are located here:
https://github.com/bulygin1985/ML_CV_study/blob/m…
-
你好,MV-softmax中 cos_theta[mask] = self.mv_weight * hard_example + self.mv_weight - 1. 与论文的实现好像有点差异,怎么理解?
-
Hello,
Thank you for sharing your code, it has been incredibly useful!
I am currently trying to use your MAPPO or HAPPO to run my tasks where my actions are n-dimensional continuous actions. The…
-
C:\Users\droly\Paints-UNDO>python gradio_app.py
A matching Triton is not available, some optimizations will not be enabled
Traceback (most recent call last):
File "C:\Users\droly\miniconda3\envs…
-
### Description
Brought up in #846
```python
import pytensor
import pytensor.tensor as pt
x = pt.vector("x", dtype="int64")
out = pt.special.softmax(x)
# Doesn't seem right
out.dprint…
-
Hi, there are four categories of labels in my dataset.json, and now the prediction is 4 labels per patient in one nii.gz file, if I want to output a specific nii.gz file for each patient's label in th…