-
My GPU is 4060Ti with 16g VRAM and 32g RAM.I am encountering a CUDA Out of Memory error when training a network using the flux_train_network.py script, even though the system shows that there is suffi…
-
Following the code from https://trankit.readthedocs.io/en/latest/training.html#training-a-lemmatizer i get a KeyError: 'lemma':
```
Setting up training config...
Initialized lemmatizer trainer
Tra…
-
This is quite a good work. I used pip install dev.. to install, but there are quite a lot problems.
(vlmrm) root@autodl-container-d33848b29e-3752a142:~/vlmrm/vlmrm# vlmrm train "$(cat config.yaml…
-
## ❓ Questions and Help
Hi!
We are trying to train Gemma-2-9B on v4-64 and v5-128 Pod as mentioned in [this comment](https://github.com/pytorch/xla/issues/7987#issuecomment-2352326629). We use FS…
ayukh updated
3 weeks ago
-
There needs to be some fixes for this to work, it is not working as of now with Python 3.10.
Most of the installer problems are solved with theese:
```
!pip install wldhx.yadisk-direct
…
-
**Describe the bug**
Using torch 2.1.1, running bash examples/bert/train_bert_340m_distributed.sh produces JIT error due to the Sequence annotator in `calculate_logits_max`
```
return torch.jit.scr…
-
Hello
when running train.py it eventually runs into this issue:
```
Traceback (most recent call last):
File "C:\work\tekuchi\SuGaR-main\train.py", line 170, in
refined_sugar_path = ref…
-
Hi,
Thanks for sharing this great project.
I want to try run this project with my own dataset, but the training phased failed when calculating [object cross entropy loss](https://github.com/lkeab/ga…
-
https://github.com/DFHack/dfhack/blob/5b5a1ac363d83615584caa37535e943679a2b0b8/library/modules/Units.cpp#L509
Animal can also get trainer in vanilla UI when it is (fully) tamed and player tries to …
-
### Feature request
From my understanding of the current implementation, the modules_to_save wrappers are currently limited to copying only one specific layer of the model (reference: https://github.…