grads Search Results - Githubissues

1000+ results
for grads

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sissa-data-science/DADApy #138

Problem with LSDI option on BMTI density estimation

### Subject of the issue When using **delta_F_inv_cov="LSDI"** while calling **compute_density_BMTI**, an error appears and the call crashes. ### Your environment colab notebook, python 3.10…

alexdepremia updated 2 months ago
1
pytorch/pytorch #118396

Dynamo x autograd.Function: silently ignores all of the ctx.…

In fact it looks like we may silently ignore them. The ctx methods in question are: - ctx.mark_dirty() - ctx.mark_non_differentiable() - ctx.set_materialize_grads() Repro: ``` import torch …

zou3519 updated 1 month ago
2
google/flax #4144

Support for optax lbfgs and related optimizers with NNX

I am trying to use L-BFGS and related optimizers with nnx + optax, but running into trouble. It might be that `optax` has a slightly different optimization interface in those cases: https://optax.rea…

jlperla updated 1 month ago
1
vikhyat/moondream #91

PEFT LoRA with gradient checkpointing

Getting the following error when using gradient checkpointing with PEFT LoRA training. > NotImplementedError > self.get_input_embeddings() ``` Traceback (most recent call last): File "/home…

rockerBOO updated 5 months ago
4
luyug/GradCache #31

Implement Grokfast into GradCache

I would like to implement the algorithm for grokfast, which is an exponentially weighted mean of past gradients added to the current gradients, with GradCache. I've been able to use it without GradCac…

ben-walczak updated 2 months ago
2
MooreThreads/Moore-AnimateAnyone #153

ValueError: Attempting to unscale FP16 gradients.

Get an error during stage 2. ``` Traceback (most recent call last): File "/mnt/petrelfs/wangzhao/HumanT2V/AnimateAnyone/train_svd.py", line 803, in main(config) File "/mnt/petrelfs/wangz…

Kyfafyd updated 1 month ago
2
pytorch/pytorch #128703

autograd with `is_grads_batched=True` fails on GroupNorm

### 🐛 Describe the bug Hi ! I'm trying to backward my model along multiple directions, so I'm using `torch.autograd.grad` with `is_grads_batched=True`. I had no problem using it on a MLP, but wh…

TheotimeLH updated 3 months ago
4
ddbourgin/numpy-ml #93

Example of MLP architecture

Thank you for this package. I'm looking for some example on how to implement simple MLP (Multi Layer Perceptron) with this package. Any code snippets or tutorials are welcome. Below is some code th…

pplonski updated 1 month ago
1
pytorch/pytorch #58522

DDP grads dont have parity with local training when grads ar…

## 🐛 Bug The following test fails for DDP: ``` @unittest.skipIf(BACKEND != 'nccl' and BACKEND != 'gloo', "Only Nccl & Gloo backend support DistributedDataParallel") …

rohan-varma updated 3 years ago
3
ikostrikov/pytorch-a3c #55

how to under ensure ensure_shared_grads?

I am kind of confused of the ensure_shared_grads here https://github.com/ikostrikov/pytorch-a3c/blob/master/train.py#L13. Here, the `grad` is synced only when it is `None`. I think we need to set `sha…

luochao1024 updated 5 years ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for grads

1000+ results
for grads