llm-jp / scripts

Apache License 2.0
1 stars 1 forks source link

Dump raw training data for the LLM-jp-3 series #46

Open hkiyomaru opened 1 month ago

hkiyomaru commented 1 month ago

Dump raw training data for the LLM-jp-3 series. For each training instance, the following fields should be included at least:

hkiyomaru commented 1 month ago

https://github.com/llm-jp/Megatron-LM/tree/nii-geniac-dump