Closed eu9ene closed 10 months ago
@jelmervdl
Traceback (most recent call last):
File "/Users/jelmer/Workspace/statmt/empty-trainer/tests/test_typos.py", line 95, in test_regression_40
self.assertNotEqual(next(iter(modifier([line]))), line)
File "/Users/jelmer/Workspace/statmt/empty-trainer/src/opustrainer/modifiers/typos.py", line 200, in __call__
yield self.apply(line)
File "/Users/jelmer/Workspace/statmt/empty-trainer/src/opustrainer/modifiers/typos.py", line 230, in apply
getattr(data, modifier)()
File "/Users/jelmer/.virtualenvs/opustrainer/lib/python3.8/site-packages/typo/Errer.py", line 69, in extra_char
char_to_add = en_default.get_random_neighbor(trigger_char)
File "/Users/jelmer/.virtualenvs/opustrainer/lib/python3.8/site-packages/typo/keyboardlayouts/en_default.py", line 118, in get_random_neighbor
return random.choice(NEIGHBORINGNUMPADDIGITS[char])
KeyError: '٦'
This happens while training a backward model for lt-en. If I remove the
typos
modified, the problem goes away.Failed task: https://firefox-ci-tc.services.mozilla.com/tasks/HBOXDKykSoGwm6iHcKN5jw Training log: https://firefoxci.taskcluster-artifacts.net/HBOXDKykSoGwm6iHcKN5jw/0/public/logs/live_backing.log Training corpus: https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/C_CXBDwhScC8Gzy7J9iJhw/runs/0/artifacts/public%2Fbuild%2Fcorpus.en.zst https://firefox-ci-tc.services.mozilla.com/api/queue/v1/task/C_CXBDwhScC8Gzy7J9iJhw/runs/0/artifacts/public%2Fbuild%2Fcorpus.lt.zst
Opus trainer config:
Training config: