LeeSureman / Flat-Lattice-Transformer

code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
1k stars 178 forks source link

数据处理preprocess.py报错 #86

Open tomatowithpotato opened 3 years ago

tomatowithpotato commented 3 years ago

发生异常: UnicodeDecodeError 'gbk' codec can't decode byte 0x84 in position 964: illegal multibyte sequence File "D:\MyCode\python\Named Entity Recognition\Flat-Lattice-Transformer-master\preprocess.py", line 15, in lexicon_lines = lexicon_f.readlines()

yuanshandaren commented 2 years ago

发生异常: UnicodeDecodeError 'gbk' codec can't decode byte 0x84 in position 964: illegal multibyte sequence File "D:\MyCode\python\Named Entity Recognition\Flat-Lattice-Transformer-master\preprocess.py", line 15, in lexicon_lines = lexicon_f.readlines()

用windows产生的问题,换linux就不会产生了这个bug了