Closed isVoid closed 4 years ago
hm, I cannot reproduce this (on Linux), even when changing my locale to something that isn't UTF-8.
As a quick note: your powershell uses cp1252 as default encoding, but you're trying to work with UTF-8 encoded files. You can probably work around the problem by changing the encoding in your powershell, or upgrading to powershell 6, which uses UTF-8 by default.
Myle Ott also commented elsewhere that fairseq expects UTF-8 throughout: https://github.com/pytorch/fairseq/issues/1287#issuecomment-566270467
I'm closing this, since this issue is ultimately caused by running fairseq in a non-supported environment.
I ran the fairseq-interactive from fairseq here. The environment is Windows 10 18363, Powershell 5.1.18362.628. Python version is 3.7.5.
The interactive executable failed to launch with
Further investigation showed that
arg_parse
inapply_bpe::create_parser
use "cp1252" encoding if not specified inargparse.FileType()
under my environment. Addingencoding='utf-8'
to the function in the line of--codes
temporarily fix the problem. But this is very hacky.