IBM / regression-transformer

Regression Transformer (2023; Nature Machine Intelligence)
https://www.nature.com/articles/s42256-023-00639-z
MIT License
144 stars 21 forks source link

Bigsmiles Tokenizer #15

Closed jannisborn closed 1 year ago

jannisborn commented 1 year ago

Add support for bigsmiles tokenization based on a regexp. Favoring simple regexp compared to bigsmiles.tokenizer.text_tokenize in bigsmiles repo for speed reasons