Closed pjuangph closed 1 year ago
Hi @pjuangph,
Thanks for the interest.
ExpressionBertTokenizer
). That's not problematic, because we dont finetune XLNet, we train it entirely from scratch on molecules (original XLNet was only trained on natural text). So, generally we use this ExpressionBertTokenizer
for modeling small molecules, proteins and chemical reactions. But we have one application on a NLP dataset. For that one dataset, we finetune the original XLNet model and hence wrote this XLNetRTTokenizer
which inherits from the XLNetTokenizer
.Hope this helps
Closing this, feel free to reopen if applicable