-
In the documentation section for Head gradient and the chain rule, I think it might be better to explain the context behind head gradient in a bit more detailed way.
Like if we refer to the class-no…
-
### System Info
- `transformers` version: 4.43.3
- Platform: Linux-5.15.0-46-generic-x86_64-with-glibc2.31
- Python version: 3.9.19
- Huggingface_hub version: 0.24.2
- Safetensors version: 0.4.3
…
-
This probably isn't an issue for this SDK, I just added this as a "discussion"...
I've been messing around with getting Neural Networks to play games, for example, but I want to take it a step furthe…
-
**Description**
We want a model that has learned crisp and easily interpretable search algorithms. Such a model will solve mazes with high accuracy. However our ability to train such models is impa…
-
I used 8 gpus to train the model, but most memory is placed on the first GPU and i can not fully utilize other gpus, is threre any solution? thanks!
-
Hello,
I am working on using the o2grad package for a GAN, and I am running into the following error.
G.zero_grad()
device = "cpu"
bs = 32
z = torch.randn(bs, 30).to(device)
…
-
Hello,
Currently, calling `pack_padded_sequence` with some length of size zero raises the error:
```
ValueError: Length of all samples has to be greater than 0, but found an element in 'lengths' …
-
https://github.com/GiorgosXou/NeuralNetworks/blob/dee00f691357a4712ea981b6f48e84e290c4df90/examples/FeedForward_double_Xor/FeedForward_double_Xor.ino#L1
*calculates the number of *"elements in an …
-
1. I do not find the mixed objective optimization,this project no relationship with the paper—“MOON”.
2.Why use the loss function —“EuclideanLoss”?It is better use “Sigmoid Cross-Entropy” or “HingeLo…
-
Congratuations! You have done a great work. But I have a question about the center loss.
Can you explain why you wrote there two lines?
https://github.com/KaiyangZhou/pytorch-center-loss/blob/082ffa…