-
Dear keras2onnx mantainers,
I am trying to convert a keras model to onnx format using your library, but I am having trouble. I first tried with my model, then I tried just running the example noteb…
-
Context :- I am trying to run distributed training on 2 A-100 gpus with 40GB of VRAM. The batch size is 3 and gradient accumulation=1. I have attached the config file below for more details and the er…
-
OS: Windows 11
Python == 3.11.0 64 bit
Keras == 3.0.5
---------
```python
import time
import os
from math import floor
import tensorflow as tf
tf.config.experimental.enable_op_determinism()
…
-
Thank you for this wonderful example, which helped me understanding the gradient descent implementation.
I just noticed a minor mistake:
- dW_curr = np.dot(dZ_curr, A_prev.T) / m
- db_curr = np…
-
I was trying to create this model but i ran into some errors can u have a look.
```
`--------------------------------------------------------------------------
TypeError …
-
As in the title, the adversarial losses don't change at all from 1.398 and 0.693 resepectively after roughly epoch 2 until end. Though G_l2_loss does change. Any ideas whats wrong? I've tried changing…
-
Thanks for the great work.
When I use the Sigmoid activation function. It raises and NotImplementedError: " Function `bound_forward` of `BoundSigmoid(name="/18")` is not supported yet"
I think …
-
Using Version 0.11.1
I wanted to modify a particular head in order to modify some calculations fullfilling the agent requirements and found that you cannot instantiate the new head if it doesnt liv…
-
The derivative of sigmoid is very small when the scores are away from zero, which is why sigmoid activation has all but abandoned in deep learning. In the original GAN, the logarithm of sigmoid is use…
-
Hi, I have some questions about the training speed when using multi-gpus with mxnet as the backend for keras. According to https://mxnet.incubator.apache.org/how_to/multi_devices.html, which said "By …