可以提供更详细一些的信息吗？

wenge-research / YAYI

雅意大模型：为客户打造安全可靠的专属大模型，基于大规模中英文多领域指令数据训练的 LlaMA 2 & BLOOM 系列模型，由中科闻歌算法团队研发。(Repo for YaYi Chinese LLMs based on LlaMA2 & BLOOM)

https://www.wenge.com/yayi/index.html

Apache License 2.0

3.26k stars 44 forks source link

Closed nuoma closed 12 months ago

nuoma commented 1 year ago

你好，关于你们的模型yayi-13b-llama2，希望能够提供更多的信息。比如我现在能看到词表32005是没有动过的。你们的增量中文预训练数据大约是什么样的量级？有没有拿他跑过类似ceval和cmmlu这类的分数？谢谢！

huxiaosheng123 commented 1 year ago

wenge-research commented 12 months ago

开源版本我们扩展了用于Chat的special token；Ceval 当时9月份打过榜，如图：

mmlu等其他榜单我们提交了huggingface open_llm_leaderboard。