NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.
Apache License 2.0
24
stars
2
forks
source link
Both `load_benchmark('NusaTranslation')` and `load_benchmark('NusaWrites')` tries to download a file that does not exist #19
Python 3.10, NusaCrowd 0.1.2
Error message:
FileNotFoundError: Couldn't find file at https://raw.githubusercontent.com/IndoNLP/nusa-writes/main/data/nusa_kalimat-mt-bug-train.csv