-
### Component
Forge
### Have you ensured that all of these are up to date?
- [X] Foundry
- [X] Foundryup
### What version of Foundry are you on?
_No response_
### What command(s) is the bug in?
…
-
File: [/master/optimizer-hints.md](https://docs.pingcap.com/zh/tidb/dev/optimizer-hints)
please add these items:
```
INDEX_JOIN
INDEX_HASH_JOIN
INDEX_MERGE_JOIN
```
ref asktug topic: http…
-
In the README for the distributed optimizer, it is mentioned that when using bf16 training, a combination of bf16 model parameters and fp32 model grads is employed, and the distributed optimizer's fp3…
-
The docstring for the Optimizer class notes that when using the `.step` function with a closure, this closure should not change the parameter gradients:
https://github.com/pytorch/pytorch/blob/4e93…
khdlr updated
7 months ago
-
Hello, it looks like EmbeddingBagCollection forces data type to be float32 or float16 during initialization.
https://github.com/pytorch/torchrec/blob/main/torchrec/modules/embedding_modules.py#L179
…
-
The implementation of DOT seems based on SGD with momentum. Since vision transformers usually use AdamW as optimizer, how about adapting the DOT to other optimizer such as AdamW or Lamb?
-
Dear all,
NextFace complained about cannot find any face in the image. I tried to save the image as jpg or png but it didn't work. I even tried to
use your image in Github but without any luck. Sor…
-
See the last few code boxes of https://colab.research.google.com/drive/1U_qMlcQfD1Dxe-_V9cHpNOA2iF8X1jYg#scrollTo=emIvHzxwQtzj&line=21&uniqifier=1
**Adam#Lion optimizer coding**
```python
impor…
-
When I run the training code, I encounter ModuleNotFoundError: No module named 'utils'
![image](https://github.com/user-attachments/assets/be0ac048-32bd-4b40-9606-bbce9a1e3048)
-
Github: https://github.com/Liuhong99/Sophia
According to the paper on arxiv: https://arxiv.org/pdf/2305.14342.pdf It has a 2 time speed up compared to AdamW.