wenge-research / YAYI2

YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
Apache License 2.0
3.61k stars 17 forks source link

Alignment data #2

Closed averkij closed 10 months ago

averkij commented 10 months ago

你好。Thank you for your work.

Is the SFT dataset available?

It would be great to adopt to different languages.

yhyu13 commented 10 months ago

They are supposed to work on the chat model, the alignment dataset should be released by that time

wenge-research commented 10 months ago

We will release the chat model in the future, but there are no release plans for the SFT dataset yet.