grads Search Results - Githubissues

1000+ results
for grads

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FluxML/Zygote.jl #991

gradients with aliased variables

I was trying to figure out how to properly handle and update Flux's layers with tied weights ( https://github.com/FluxML/Flux.jl/issues/1592). So first of all I wanted to check how Zygote handles a…

CarloLucibello updated 2 years ago
9
PaddlePaddle/Paddle #56407

单测系统test/legacy_test/eager_op_test.py似乎未对低精度算子执行有效测试

### bug描述 Describe the Bug ### 问题复现： 1. 准备一段本不能通过测试的示例此处采用[#53078/commits/modify the test_lerp_op.py](https://github.com/PaddlePaddle/Paddle/pull/53078/commits/518866d67f79b9b1aedc3135fe669881c9a…

Difers updated 1 year ago
3
wojciechmo/deep-compression #1

Gradient Modification

``` gradients_vars = optimizer.compute_gradients(loss, LAYERS_WIEGHTS) grads = [grad for grad, var in gradients_vars] train_step = optimizer.apply_gradients(gradients_vars) ``` Hi, in this code, …

Site1997 updated 5 years ago
1
pytorch/pytorch #115785

arctan2 fp16 error when optimising

### 🐛 Describe the bug I can't narrow it down further, but torch.arctan2 seemingly calculates the correct gradients and optimises correctly for fp32,fp64 and bfloat16, but for some reason, the versio…

nicholas-greig updated 9 months ago
3
pytorch/pytorch #133554

[Optim][Dynamo] Tensor unproperly assigned in Adagrad optimi…

### 🐛 Describe the bug I am testing the function of optimizer using torch dynamo, I found that there is a small problem in Adagrad, **state["step"]** was assigned to CPU while other parameters are …

kkie02 updated 1 month ago
4
tensorflow/tensorflow #61285

ValueError: No gradients provided for any variable

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.12 ### Custom code Yes ### OS platform and d…

innat updated 1 year ago
4
google/brax #290

While loop in generalized/math forces to use grads Forward m…

https://github.com/google/brax/blob/280a1c50fa021b6c17a2a3347fea43a2887382bc/brax/v2/math.py#L278 For people who want to calculate gradients over environment steps, this while loop is a bit annoyin…

gijskoning updated 1 year ago
1
NVIDIA/Megatron-LM #716

[BUG] Permormance drop while training with MoE

**Describe the bug** During our training sessions utilizing Megatron's Mixture of Experts (MoE) layers, we observed a decline in performance occurring at specific steps, with this deterioration manif…

Teng-xu updated 2 months ago
8
mir-group/pair_nequip #15

❓ [QUESTION] Error while using potential in lammps. RuntimeE…

Lammps runs and terminate after sometim ``` terminate called after throwing an instance of 'std::runtime_error' what(): The following operation failed in the TorchScript interpreter. Tracebac…

n0w0f updated 1 year ago
21
pytorch/pytorch #64093

test_backward_accumulate_grads (__main__.TensorPipeDistAutog…

https://app.circleci.com/pipelines/github/pytorch/pytorch/371361/workflows/a14cd549-a37b-47ce-ad6a-a8baa3d40a54/jobs/15666994/steps ``` Aug 26 22:20:15 test_backward_accumulate_grads (__main__.T…

mrshenli updated 2 years ago
2

上一页 1...16 17 18 19 20 21 22...100 下一页

1000+ results for grads

1000+ results
for grads