ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Apache License 2.0
7k stars 570 forks source link

训练垂直领域大模型应该基于哪个版本? #555

Closed Zheng-Jay closed 2 months ago

Zheng-Jay commented 3 months ago

提交前必须检查以下项目

问题类型

None

基础模型

None

操作系统

None

详细描述问题

我们收集了垂直领域的预训练数据和指令数据(混合了通用数据),应该在Chinese-Llama-2还是Chinese-Alpaca2上二次开发呢?我看好像大家都是基于base做二次pt和sft,但是我不想浪费掉instruct版本的指令,基于哪个版本训效果更优呢?

依赖情况(代码类问题务必提供)

# 请在此处粘贴依赖情况(请粘贴在本代码块里)

运行日志或截图

# 请在此处粘贴运行日志(请粘贴在本代码块里)
yourcaptain commented 3 months ago

我也有相同的疑惑,盼答复。

github-actions[bot] commented 2 months ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.

github-actions[bot] commented 2 months ago

Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.