dqwang122 / HeterSumGraph

Code for ACL2020 paper "Heterogeneous Graph Neural Networks for Extractive Document Summarization"
244 stars 52 forks source link

生成的VOCAL_FILE文件读取编码不对UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa1 in position 236: invalid start byte #43

Open CSgaoan opened 1 year ago

CSgaoan commented 1 year ago

屏幕截图 2023-07-06 222122 Traceback (most recent call last): File "E:\project\HeterSumGraph-master\train.py", line 438, in main() File "E:\project\HeterSumGraph-master\train.py", line 385, in main vocab = Vocab(VOCAL_FILE, args.vocab_size) File "E:\project\HeterSumGraph-master\module\vocabulary.py", line 53, in init for line in vocab_f: # 遍历文件的每一行 File "C:\Users\24672\anaconda3\envs\Pytorch\lib\codecs.py", line 322, in decode (result, consumed) = self._buffer_decode(data, self.errors, final) UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa1 in position 236: invalid start byte

CSgaoan commented 1 year ago

If you know how to solve this problem, I hope you will not hesitate to enlighten.

CSgaoan commented 1 year ago

Thank you very much!