bitextor / bifixer

Tool to fix bitexts and tag near-duplicates for removal
GNU General Public License v3.0
29 stars 3 forks source link

Bifixer doesn't see input file #2

Closed Syrkovski closed 5 years ago

Syrkovski commented 5 years ago

Hello, I tried use bifixer, but I have error. The command I used:

python3.7 bifixer/bifixer.py --scol 1 --tcol 2 --ignore_duplicates corpus corpus.bifixed en zh

The error:

bifixer.py: error: argument input: can't open 'corpus': [Errno 2] No such file or directory: 'corpus'

I have the corpus.en and corpus.zh, but bifixer doesn't see it.

mbanon commented 5 years ago

Hi @Syrkovski . It has to be in a single file (paste corpus.en corpus.zh > corpus)

Syrkovski commented 5 years ago

Thank you