I was trying to use these scripts with a dataset I downloaded that was in UTF-8. Python 3 uses the system character encoding by default, which on an en-US Windows is CP1252, so it failed. This patch adds an --input_encoding argument to train.py so I could simply run train.py --input_encoding=utf8.
I also fixed the help text for --gpu_mem, which had bare percent signs which broke things when running train.py --help.
Thanks for publishing this repository, it's really helpful!
I was trying to use these scripts with a dataset I downloaded that was in UTF-8. Python 3 uses the system character encoding by default, which on an en-US Windows is CP1252, so it failed. This patch adds an
--input_encoding
argument to train.py so I could simply runtrain.py --input_encoding=utf8
.I also fixed the help text for
--gpu_mem
, which had bare percent signs which broke things when runningtrain.py --help
.Thanks for publishing this repository, it's really helpful!