-
Traceback (most recent call last):
File "F:\xiazai\MedicalGPT-main\merge_peft_adapter.py", line 109, in
main()
File "F:\xiazai\MedicalGPT-main\merge_peft_adapter.py", line 64, in main
…
-
File "F:\xiazai\MedicalGPT-main\dpo_training.py", line 497, in
main()
File "F:\xiazai\MedicalGPT-main\dpo_training.py", line 472, in main
train_result = trainer.train()
File "C:\User…
-
Traceback (most recent call last):
File "F:\xiazai\MedicalGPT-main\ppo_training.py", line 516, in
main()
File "F:\xiazai\MedicalGPT-main\ppo_training.py", line 270, in main
model = Au…
-
基于yi-6B模型,进行全参数SFT后,infer结果为空。transformer版本为4.37.2
```
报错:
Some weights of LlamaForCausalLM were not initialized from the model checkpoint at *path*and are newly initialized:
You should probably…
nuoma updated
6 months ago
-
### System Info / 系統信息
都是正常的
### Who can help? / 谁可以帮助到您?
_No response_
### Information / 问题信息
- [ ] The official example scripts / 官方的示例脚本
- [X] My own modified scripts / 我自己修改的脚本和任务
### Reprod…
-
You are using an old version of the checkpointing format that is deprecated (We will also silently ignore `gradient_checkpointing_kwargs` in case you passed it).Please update to the new format on your…
-
Traceback (most recent call last):
File "C:\Users\admin\.conda\envs\newrlhf\lib\site-packages\transformers\utils\hub.py", line 398, in cached_file
resolved_file = hf_hub_download(
File "C:\…
-
大佬,在增量预训练时,使用的是企业的一些简介和经营范围进行尝试训练(所有数据都是领域数据的文本)差不多使用了10W条数据,但是在训练时,发现loss一直在缓慢的增加,请问您遇到过这种问题吗?
启动命令:
CUDA_VISIBLE_DEVICES=0 torchrun --nproc_per_node 1 pretraining.py --model_type chatglm …
-
Traceback (most recent call last):
File "/content/MedicalGPT/supervised_finetuning.py", line 1394, in
main()
File "/content/MedicalGPT/supervised_finetuning.py", line 1315, in main
mo…
-
### Describe the Question
Please provide a clear and concise description of what the question is.
我看了一下baichuan2的chat template。https://github.com/shibing624/MedicalGPT/blob/474b32c352423b4051dbf07…