Open JohnGiorgi opened 5 years ago
Currently, to align BERT tokens to original tokens (before BERT tokenization) we use some code I grabbed from the official BERT repo.
SpaCy has introduced functions specifically for aligning two tests tokenized with different tokenizers. Switch to this!
Currently, to align BERT tokens to original tokens (before BERT tokenization) we use some code I grabbed from the official BERT repo.
SpaCy has introduced functions specifically for aligning two tests tokenized with different tokenizers. Switch to this!