-
Hello,
Thanks so much for this rich AutoML project! I enjoyed the CARS paper and I'm excited to try it out.
The basic example given [here](https://www.noahlab.com.hk/opensource/vega/page/doc.htm…
-
Hi
I am encountering an error when using 2 GPUs for training YOLOv10n. Here is the error:
```
(yolov10) C:\Users\muh\yolov10>yolo detect train data=coco.yaml model=yolov10n.yaml epochs=500 batc…
-
I'm trying to train on two datasets at once. I'm using npy file list files for my datsets. Is training on multiple datasets at once supported? I put both in my config file, but I just keep getting thi…
-
Hello, sorry im quite new to writing issues.
I trained a joint token classification and sequence classification model. To save it i used this:
`trainer.save_model("multi_task/")`
However trying t…
-
Using a well-crafted FAUXPILOT, we can execute inference tasks based on the Codegen model. I read recently that I can work on Fine-tune using the Codegen model on the following website.
* https://new…
-
When I try to evaluate with your code, I met GPU Memory Issue.
Especially, running this code
`CUDA_VISIBLE_DEVICES=0,1,2,3 mpirun -n 4 python entry.py evaluate --conf_files configs/see…
-
I tried to run the example in lightseq/examples/training/fairseq (I just entered the two lines of commands in README). After a few seconds, there raised an error:
‘’‘
return logging_output
Unboun…
-
### What happened + What you expected to happen
While enabling the `frequency` parameter In the Keras Callback (`from ray.air.callbacks.keras import Callback`), the checkpoints folder does not includ…
-
I met the **"ZeroDivisionError: float division by zero"** when I want to train the model with multi-gpu. And if only 1 gpu, the problem disappear but the training is too slow...
And the detailed trac…
-
## 🐛 Bug
Got RuntimeError when training transformer from scratch under `translation_multi_simple_epoch` task with fully sharded data parallel (FSDP).
### To Reproduce
Steps to reproduce the b…
thpun updated
3 years ago