Open eu9ene opened 7 months ago
I'm getting the same error trying to use bicleaner-harddrules with languages en-da (English-Danish), en-nl (English-Dutch) as well as en-pt (English-Portuguese). The environment is Python 3.10 and using Windows Linux Subsystem on Windows. I tried it in Google Colab and it does seem to work there. Not yet sure what the difference is yet. Both are on Ubuntu Jammy.
Error:
[0/0:bicleaner_hardrules] Traceback (most recent call last):
[0/0:bicleaner_hardrules] File "/home/user/.local/lib/python3.10/site-packages/opuscleaner/filters/../threshold.py", line 142, in wrapper
[0/0:bicleaner_hardrules] return fn(*args, **kwargs)
[0/0:bicleaner_hardrules] File "/home/user/.local/lib/python3.10/site-packages/opuscleaner/filters/../threshold.py", line 192, in threshold_scores
[0/0:bicleaner_hardrules] item[1].score = float(fchild.readline())
[0/0:bicleaner_hardrules] ValueError: could not convert string to float: b''
[0/0:bicleaner_hardrules] Killed
Filtering fails on some datasets, for example, en-ru OPUS XLEnt
Cleaning config: