Closed thomaschhh closed 10 months ago
Yes, you are right, thanks!
Btw, check the newer version of the dataset on my huggingface page, it contains some intermediate files too.
Feel free to ask questions
I think I found another typo: https://github.com/bene-ges/nemo_compatible/blob/194af660d9b6d3d578884048d40b524775fd10e8/scripts/nlp/en_spellmapper/dataset_preparation/prepare_corpora_after_alignment.py#L246
Based on Line 23 it should be extract_giza_alignments
and not extract_alignments
.
Thanks, fixed in .sh
file
I think NEMO_PATH
should be NEMO_COMPATIBLE_PATH
Sure, fixed
I am trying to walk through the steps mentioned at https://github.com/bene-ges/nemo_compatible/blob/main/scripts/nlp/en_spellmapper/README.md and am running into:
''FileNotFoundError: [Errno 2] No such file or directory: 'yago.uniq3'"
Might it be the case that it's supposed be "yago.uniq2" instead?
https://github.com/bene-ges/nemo_compatible/blob/194af660d9b6d3d578884048d40b524775fd10e8/scripts/nlp/en_spellmapper/dataset_preparation/run_g2p.sh#L21