-
I had run pretty much this exact command a couple weeks ago when doing benchmarking but now it is failing with a stride mismatch error. Creating this issue so others can take a look as well. Repro and…
-
Great work!Can this method be used for regression tasks where the loss function is L1
-
Hi there,
I'm currently working on improving the numerical stability of mixed precision training by proposing a new loss scaling method. It is called adaptive loss scaling, which calculates a prope…
-
## Related Reference
- Goodfellow I, Bengio Y, Courville A. [Deep learning]().
- Bishop C M, Nasrabadi N M. [Pattern recognition and machine learning]().
- Murphy K P. [Machine learning: a proba…
-
I am using tensorflow 2 model along with shap 0.39. I can not get any results due to following error.
LookupError: gradient registry has no entry for: shap_LeakyRelu
Not sure what to do since i ne…
-
Using TensorFlow 2.4 with GPUs
**Describe the current behavior**
//tensorflow/python/keras/integration_test:gradients_test fails when run on a node with GPUs. I traced the problem to inexact com…
-
need to do the following:
1. derive the equations behind the model
2. implement the equations and ensure strong results
-
EDITED
-
I am trying to use the framework to continue pretraining llama3-8B. I have converted the HF checkpoint into nanotron format and the generated tokens seem reasonable.
I use the following setting to…
-
Hi,
I just stuck with exception which doesn't allow me to make my learning correct.
```
org.nd4j.linalg.exception.ND4JIllegalStateException: X, Y and Z arguments should have the same length for Pai…