-
I tried to reproduce your gemma2B reward model training again and found that the reward model architecture fine tuned with internlm2 had an output header of 1. I downloaded your GRM-Gemma-2B-Sftrug re…
-
can we please get official support for internLM-2.5?
I have seen a closed issue regarding that #734. however, the model mentioned there might be broken as it fails to load for instance.
It woul…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
### Describe the bug
Segmentation fault whe…
-
背景:
模型internlm2.5-7b-chat
单机4卡A10,单卡24GB
问题:
1. xtuner能实现模型切分么?即多个卡共用一个模型,而不是在每个卡上都单独加上一个模型再去微调训练,这样的话很容易会显存不足;能做到说模型切分到了某张卡运行,其余的卡加载数据训练吗?
2. 按官方要求的多轮对话格式训练时,其中的单个message json就达到了几百个,且暂时不拆分数…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
### Describe the bug
``` shell
D:\AI_model>lmdepl…
-
### What is the issue?
In previous versions, I set the context length of each of my models to the maximum value that could be fully loaded onto the GPU memory. However, after the update, I found that…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [ ] 2. The bug has not been fixed in the latest version.
### Describe the bug
Model: internlm2-chat-7b
GPU…
-
这个模型效果非常不错,数学上面接近gpt4o了
-
### Describe the bug
我好像没有找到用internevo训练然后转换成对应的hf的脚本?请问有提供嘛?
### Environment
官方代码
### Other information
_No response_
-
如题,是开放出来了?对应是datasets文件夹下的哪个数据?
同理:
老母亲心理咨询师 模型 对应的训练数据是哪几个?
艾薇御姐 模型 对应的训练数据是哪几个?
感谢。