-
## Describe the bug
I would like to combine [hydra multi-runs](https://hydra.cc/docs/tutorials/basic/running_your_app/multi-run/) with ClearML remote execution. I.e. configuring a multi-run task with…
-
Hello,
Thank you for sharing your work! I'm getting the error below after training with the mezo.sh script:
RuntimeError: Default process group has not been initialized, please make sure to call…
-
您好,项目对数据有什么特别的要求吗,比如通道、设备之类的,我的一些数据成功进行了融合,但另一批数据出现了融合报错无法运行的情况,感谢解答
-
When `test_batch_size` is set through command line, e.g.,
```
python run_mwptoolkit.py --model=GTS --dataset=mawps --task_type=multi_equation --gpu_id=0 --equation_fix=prefix --test_batch_size=32
`…
-
### 🐛 Describe the bug
Run trainer return with error
```
Traceback (most recent call last):
File "/home/dhl/LongChat-dev/longchat/dist_attn/train.py", line 9, in
train()
File "/home/dhl…
-
code:
import torch
import transformers
from transformers import AutoTokenizer, AutoModelForCausalLM, TrainingArguments
import pyreft
from huggingface_hub import login
login(token="***")
model_n…
-
### System Info
Hi Team,
when i am running the above qlora code for owl-vit model (google/owlvit-base-patch32) with below 4 bits bnbconfig , the fine tuning is taking place without any error.
b…
-
### System Info
```Shell
compute_environment: LOCAL_MACHINE
distributed_type: 'MULTI_GPU'
downcast_bf16: 'no'
gpu_ids: all
machine_rank: 0
main_training_function: main
mixed_precision: 'fp16…
-
### What happened + What you expected to happen
When the user needs to execute the DDPG algorithm,the current DDPGTrainer can only support the single-machine version of the algorithm. If user needs t…
-
- High: It blocks me to complete my task.
Hi, I’m new to OpenAI Gym and RLlib. SO my question may be dumb.
I'm using
Anaconda python 3.9
Gym 0.21.0
Ray 1.12.1
Tensorflow 2.8
…