Closed jettjaniak closed 2 months ago
Looks like you need to setup black, CI is failing on that
Also, could you think about some localized unit tests? Like we have a pre-defined string as a text to train on and we check if the resulting tokenizer has the same vocab and tokenizes text the way we expect.
Looks like you need to setup black, CI is failing on that