-
## Feature request
Request the implementation of the following ONNX operators:
* LogSoftmax
* Softmax
* ReduceMax
## Motivation
These operators are common in neural networks of many types;…
-
Is it possible to use this library to implement a network with a Softmax output layer? I was surprised to find this layer type neither implemented nor mentioned in the docs, since it's so common.
-
![image](https://github.com/user-attachments/assets/524f9693-a188-4246-85ed-f29ec5b4b9c6)
Please note in the [Transformer Example](https://ltrujello.github.io/Tikz-Python/examples/) the right space…
-
Hi,
Could you please point out where the gumbel softmax is applied in the code? I am unable to find it.
Thanks
-
My implementation of softmax.backward is below:
def backward(self, label):
self.delta = self.softmax.copy()
### start your code
self.delta -= label
## end your cod…
-
### Issue type
Feature Request
### Have you reproduced the bug with TensorFlow Nightly?
No
### Source
binary
### TensorFlow version
unknown 2.12.0
### Custom code
No
### …
-
There has been some community appetite for classification tasks #1249 #1124. Incidentally, due to the use of classification models for RLHF, we already have some of the necessary components to support…
-
I solve the long softmax puzzels, but I have to store the intermediate results to z_ptr, which may cause unnecessary Memory I/O.
Essentially, I would like to know if there's a solution to create te…
-
Test the implementation as the second model in the report.
-
I will write some improvement for the softmax implementation as CCW, here is my plan:
- [ ] Allow to have n-dimensional tensor as input where the softmax operation is applied on the last dimension.…