deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MIT License
3.47k stars 143 forks source link

About datasets #13

Closed ftgreat closed 4 months ago

ftgreat commented 4 months ago

Hi, thank you for your great work!

Could you provide more details about the pretrain dataset? How has the pretrain dataset been optimized in DeepSeek-V2 compared to the previous version, DeepSeek?

Thank you.