ondrejklejch / MT-ComparEval

Tool for comparison and evaluation of machine translation.
Apache License 2.0
56 stars 14 forks source link

Task import fails with "bad number of sentences" but number of sentences is ok #79

Open lefterav opened 6 years ago

lefterav commented 6 years ago

I am having here an issue where source, reference and translation contain 907 sentences (this is also reported at the log output for all three imports). Experiment (source and reference) is imported with no issue, but importing the task gives me this error. #

[24-Jan-2018 14:37:31]  source.txt used as a source source.
[24-Jan-2018 14:37:31]  mt-l-gedit has 907 source sentences
[24-Jan-2018 14:37:31]  reference.txt used as a reference source.
[24-Jan-2018 14:37:31]  mt-l-gedit has 907 reference sentences
[24-Jan-2018 14:37:31]  Experiment mt-l-gedit uploaded successfully.
[24-Jan-2018 14:38:10]  Importing task: mt-l-gedit:mtmt
[24-Jan-2018 14:38:10]  translation.txt used as a translation source.
[24-Jan-2018 14:38:10]  mtmt has 907 translation sentences
[24-Jan-2018 14:38:10]  mtmt has bad number of sentences
[24-Jan-2018 14:38:10]  Parsing of mtmt aborted!

I have checked and there are no empty sentences, as in previous issue. I suspect the error is due to bad file encoding, but I cannot confirm that yet