Thank for great repo @Alex-Fabbri. I follow readme.txt in Hi-Map
I run run_prep_newser, I have list .pt file after that i have newser_sents.vocab.pt.
I run run_inference_newser.sh, but i get err:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 64: invalid start byte
I found that error from when call func read data from newser_sent_500/newser_sents.vocab.pt file:
def make_text_iterator_from_file(path):
with codecs.open(path, "r", "utf-8") as corpus_file:
for line in corpus_file:
yield line
file: code/Hi_MAP/onmt/inputters/text_dataset.py
I using raw data from Raw data -- zipped
Some version from me:
torch 1.8.0
torchtext 0.9.0
cuda 11.1
Many thank for your help!!
Thank for great repo @Alex-Fabbri. I follow readme.txt in Hi-Map
run_prep_newser
, I have list .pt file after that i havenewser_sents.vocab.pt
.run_inference_newser.sh
, but i get err:I found that error from when call func read data from
newser_sent_500/newser_sents.vocab.pt
file:I using raw data from Raw data -- zipped Some version from me: torch 1.8.0 torchtext 0.9.0 cuda 11.1 Many thank for your help!!