Closed Gromy1211 closed 4 years ago
yes, this is correct. More specifically, the file lists all merge operations in the order in which they will be applied. To apply the merge operations to a new file, use apply_bpe.py
.
Thanks for the quick response!! I think I understand how the BPE algorithm works now ;)
Every line of the output file of learn.py is consisted of two subword units(for instance,
o f</w>
)I am a little bit confused by these two units, does that mean
o
andf</w>
can be merged intoof<w>
when applying BPE?Thanks for answering!