gradients Search Results

1000+ results
for gradients

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google/brax #328

assert_is_replicated in Analytic policy gradients training

When I try to use a 4-gpus machine to run the Analytic policy gradients training in parallel, it reports an AssertionError in `brax/training/agents/apg/train.py` line 255. Seems that it is because `t…

wangyian-me updated 1 year ago
8
TuringLang/DistributionsAD.jl #123

Making rand gradients work for more distributions

Trying to compute gradients of the `rand` function wrt to parameters for certain distributions will produce incorrect results, because some of these functions use branching or iterated algorithms and …

dcjones updated 2 years ago
7
tensorlayer/SRGAN #210

Gradients do not exist for these variables

WARNING:tensorflow:Gradients do not exist for variables ['batchnorm2d_1/moving_mean:0', 'batchnorm2d_1/moving_var:0', 'batchnorm2d_2/moving_mean:0', 'batchnorm2d_2/moving_var:0', 'batchnorm2d_3/moving…

adithyak-hub updated 2 years ago
4
genshinsim/gcsim #545

Substat Optimizer - Std Dev Optimization

Per convo in https://github.com/genshinsim/gcsim/pull/483 Currently we build substat gradients based on avg damage, then we interpret avg dmg gradients as our only consideration for whether any par…

jordanlovato updated 5 months ago
3
Jannoshh/simple-sam #5

Epsilon value of 1e-12 too small for mixed precision

The epsilon value of 1e-12 used in the following lines for the `first_step` and `sam_train_step` functions is too low and can cause NaN errors with training with mixed precision: `e_w = gradients[i] …

Avelina9X updated 2 years ago
1
pytorch/pytorch #22024

Consolidate definition of operators/gradients where possible

In working on #21088, there were cases where code changes needed to be made that were repetitive and could be error-prone. We could probably simplify/merge some of this code. To modify an operator …

nairbv updated 3 years ago
1
TuringLang/DistributionsAD.jl #118

Gradients of logpdf with TuringDiagMvNormal return nothing

This puzzles me a bit ```julia using DistributionsAD, Distributions, Flux using DistributionsAD: TuringDiagMvNormal Flux.@functor TuringDiagMvNormal m = [1.0] S = [0.1] f = TuringDiagMvNo…

nmheim updated 3 years ago
3
pytorch/pytorch #33081

DataParallel gives different gradients when using LSTMs

## 🐛 Bug When using DataParallel on a model with LSTMs the losses obtained compared to the same model run on a single GPU are different. ## To Reproduce Here is a sample code block that seed…

siddheshk updated 4 years ago
1
pytorch/pytorch #12795

C++ frontend: how to debug nan gradients

Hi, I am getting very large gradients and then, even with clamping, nan gradients (suddenly all of them). I am surprised because I am porting my working program from Python to C++ backends. How to de…

slaweku updated 4 years ago
3
pjreddie/darknet #606

Why update gradients of bbox like this?

@pjreddie Hi，I want to know why update gradients of bbox like the below? Why the scale gradient of bbox is "(2-truth.w*truth.h)" ? delta_yolo_box(truth, l.output, l.biases, best_n, box_index, i,…

ersanliqiao updated 5 years ago
1

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for gradients

1000+ results
for gradients