Open htw2012 opened 5 years ago
In fact you can reuse the released model if you add words by replacing the unused tokens (but you will have to train them).
@artemisart after i add new words by replacing the unused tokens, how should i train these new words ? thank you!
@zhangfazhan https://github.com/google-research/bert/issues/155 here is some advice
Hi, I have some questions about pre-training as follows:
vocab.txt
by characters. There are some low-frequency words, should low-frequency words be deleted from the dictionary?vocab.txt
that BERT-Base released ?Thank you in advance.