My corpous consists of pure numbers like 1, 2, ..., 1000000, ..., 1002342, ....
It is differen from words in any language.
Can I replace the vocab.txt with my own vocab.tx created using my corpous for fine-tuning albert?
Or, should I train albert on my corpous from scratch?
My corpous consists of pure numbers like 1, 2, ..., 1000000, ..., 1002342, .... It is differen from words in any language. Can I replace the vocab.txt with my own vocab.tx created using my corpous for fine-tuning albert? Or, should I train albert on my corpous from scratch?
Thanks.