-
Add description for gradient descent: batch gradient descent and stochastic gradient descent.
Optionally, also add details about mini-batch GD.
-
In a series simple tensorflow programs I obtain memory leaks (unbounded growth of CPU memory).
On original program on a computer with 64GB of RAM this leak is about 640 megabytes per hour (1% of tota…
-
Hi~ I tried to implement GCT on detectron2 to trian Pascal VOC in terms of object detectrion but performance isn't going up. I notice that you said in the paper: "All the backbone models are pre-trai…
-
I've got a question for the discriminator loss.
It seems when training using WGAN you can end up with increased image quality with increased loss.
I have plotted here -log D vs. generator iterat…
-
I think Optim.jl is in a prime position to generalize itself as a common interface for nonlinear optimization. While JuMP has some support for nonlinear optimization, by design it won't be able to be …
-
Hi,
### (1) Update guide to support newest version
I'm going through the "Next Steps" chapter > section "Switching to pre-trained contextualizers".
First, the config file showed in the guide uses :…
-
I'm sorry to bother you.I have run your code several times, mAP only reaches 8%, and top-1 only reaches 20% at most. Otherwise,the results are different when each time I run the code. How can I achiev…
-
I suggest to improve the English used for the documentation in the following:
> copy_initial_weights – if true, the weights of the patched module are copied to form the initial weights of the patch…
-
In Chapter 4 - Exercise: Batch Gradient Descent with early stopping for Softmax Regression, the math equations are not getting displayed correctly. I checked in Chrome, Edge and Firefox.
$J(\mathb…
-
Please make sure that this is a bug. As per our [GitHub Policy](https://github.com/tensorflow/tensorflow/blob/master/ISSUES.md), we only address code/doc bugs, performance issues, feature requests and…