KempnerInstitute / kempner-hpc-handbook

Kempner Institute HPC User Guide
https://kempnerinstitute.github.io/kempner-hpc-handbook/intro.html
Creative Commons Zero v1.0 Universal
6 stars 3 forks source link

Add eleuther-ai-gpt-neox-20b-pii-special datasets #74

Open mmshad opened 4 months ago

mmshad commented 4 months ago

Shared Data Repository page

books      c4      cc_en_head      cc_en_middle      cc_en_tail      peS2o  stack-code  wiki-en-simple
books_val  c4_val  cc_en_head_val  cc_en_middle_val  cc_en_tail_val  peS2o_val  stack-code_val  wiki-en-simple_val

Path: /n/holylfs06/LABS/kempner_shared/Lab/data/dolma/preprocessed/eleuther-ai-gpt-neox-20b-pii-special

Naeemkh commented 1 month ago

@mmshad, who is going to take this issue?