-
I found that in the benchmark/suite has the output time to first token. However, when I run `python benchmark.py --model meta-llama/Llama-2-7b-hf static --isl 128 --osl 128 --batch 1` an error occurs:…
-
[baichuan-inc/baichuan-7B](https://huggingface.co/baichuan-inc/baichuan-7B)
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
baichuan-7b和baichuan2-7b的模型结构的区别在哪里,只有normhead么,可以直接将baichuan2-7b的参数加载到baichuan-7b上对么
-
同样的数据集同样的训练参数,在4张A100使用lora tuning baichuan-7b和baichuan-13b, baichuan-13b的显存占用比baichuan-7b小很多,请问这是正常现象吗
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
### 问题描述
是否可以增加 baichuan-7B 这个模型的微调和预训练
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…
-
May I ask would Baichuan model compatible with Clover?
Could you please give some intruction for how to train the draft model for baichuan model?