SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
Other
1.21k stars 111 forks source link

the dataset (Skypile-150B) can not be download #36

Closed nicosouth closed 10 months ago

nicosouth commented 10 months ago

hello!

i want to download the dataset, Skypile-150B.

but i find that huggingface link will be 404 not found.

is there any other download link?

thank you!

zhao1iang commented 10 months ago

The dataset is currently being updated and will be back soon.

nicosouth commented 10 months ago

thank you!