-
替换为自己的数据集微调百川Baichuan-13B之后,出现了模型基础对话能力丧失的情况,不知道算不算是灾难性遗忘,有发现过这个问题的友友们可以指导一下解决我的问题吗
-
如题
![微信图片_20240110173848](https://github.com/baichuan-inc/Baichuan2/assets/39690525/13299155-a5b3-4d38-8c47-231d3fbcc045)
![image](https://github.com/baichuan-inc/Baichuan2/assets/39690525/02c12…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
model.enable_input_require_grads() 这块报错。如何解决呀
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
I am trying to convert baichuan2-megatron to hf. When reading the code, i can not understand this part
```
def permute(x):
if revert:
return x.view(head_dim//2, 2, dim).transpo…
drxmy updated
10 months ago
-
I've followed the instruction
https://github.com/triton-inference-server/tensorrtllm_backend/blob/main/docs/baichuan.md
to run Baichuan2-7b-Chat.
But for exactly the same engine, the outputs are …
-
![image](https://github.com/baichuan-inc/Baichuan2/assets/42534237/10459dc4-4d09-4ad5-ad4e-5ccfa9a8501b)
这里对HumanEval和GSM8K评测都比较低,请问能否提供一些公开数据集的方法吗?谢谢
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/…