jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Apache License 2.0
7.31k stars 426 forks source link

请问支持中文对话么? #95

Closed xman1991 closed 7 months ago

jzhang38 commented 7 months ago

我们的训练数据是slimpajama + starcoderdata.

里面有很少的中文, 但是模型的中文能力应该很差。