Open kernelmachine opened 4 years ago
Add wordpiece tokenization tools to repository, to reduce overall vocabulary size and improve training speed.
Add wordpiece tokenization tools to repository, to reduce overall vocabulary size and improve training speed.