Closed gk966988 closed 1 year ago
Now, I want to get a new tokenier, IWhich language processing model did you use to obtain the tokenizer? Can I train the transformer's BERT to obtain a tokenizer that can be used in Nougat?
You can use any tokenizer in the HF format https://huggingface.co/docs/tokenizers/api/trainers
Now, I want to get a new tokenier, IWhich language processing model did you use to obtain the tokenizer? Can I train the transformer's BERT to obtain a tokenizer that can be used in Nougat?