-
After finishing install successfully, i got this error when ran this command: python e2e_rlhf.py --actor-model facebook/opt-1.3b --reward-model facebook/opt-350m --deployment-type single_gpu
![捕获](…
-
https://github.com/microsoft/DeepSpeedExamples/blob/957ae3141946daf9a6bc5731e261032a13a82f05/applications/DeepSpeed-Chat/training/step1_supervised_finetuning/main.py#L367
just train one epoch,i got…
-
AutoModelForCausalLM 中class没有chatglm你是如何解决的呢
-
Hello,I wonder the performance of supervised finetune using CONTRIQUE encoder compared to imagenet pretrained model, but I can't find such exp in paper.
can you share the results if you have done…
-
你好徐老师,我使用的是windows四卡3090机器,每张显卡24G显存,因为是win平台,就写了一个bat脚本来运行
`@echo off
set CUDA_VISIBLE_DEVICES=0,1,2,3
call python supervised_finetuning.py ^
--model_type baichuan ^
--model_name_or_p…
-
Thanks a lot for releasing the code and the scripts for pre-training.
I'm trying to reproduce the numbers on MS-Marco after fine-tuning and it would be great if you could also release the scripts f…
-
## description
- datasets: 1.1 GB Chinese corpus, about 2 million lines
- devices
- CPU=8 GPU=1 Memory=320G Node=1 Type=A100-SXM-80GB
## question
The training process was always killed …
-
We trained our model using [AnghaBench](https://github.com/brenocfg/AnghaBench) compilation results across four optimization levels (O0~O3), selecting samples under 1024 tokens. That gave us a total o…
-
I have a question whether the pre-trained UniTS training dataset contains the test dataset. If so, then the training process of time series prediction and time series completion is essentially a self-…
-
Nice work and I have a question here:
I am trying to pre-train and finetune a model on my own datasets. However, some warnings were raised when loading the pre-trained model during finetuning:
…