Closed LearnedVector closed 4 years ago
apply_bpe.py will continue to work with old-style BPE files. To create an old-style BPE-file, you can either check out an old version, or undo the relevant changes yourself. Specifically, change
to
vocab = dict([(tuple(x)+('</w>',) ,y) for (x,y) in vocab.items()])
and remove this line:
Hello, I have a use case that needs the old style bpe algorithm that has the as a separate space. How do I use that version?