How to save tokenizer automatically?

amazon-science / tanl

Structured Prediction as Translation between Augmented Natural Languages

Apache License 2.0

130 stars 25 forks source link

How to save tokenizer automatically? #13

Closed arlethitgo closed 1 year ago

arlethitgo commented 1 year ago

Hello! I would like to ask how do you automatically save the tokenizer of the new finetuned models?

giove91 commented 1 year ago

Hi, thanks for your interest in our work! We do not change the tokenizer, so it is exactly the same as the pre-trained tokenizer. The delimiters of the "augmented natural languages" are not treated in any special way during tokenization.

I hope this clarifies!