When no tokens are retrieved for a text (e.g. preprocessing removed all words), then tokenizer is returning a ValueError about spacy lan_model not supported
the check for spacy lan_model should be done beforehand
if text is empty, tokens can be empty and should be returned as an empty list
+ Bump dependencies to latest available using poetry lock
Description
When no tokens are retrieved for a text (e.g. preprocessing removed all words), then tokenizer is returning a ValueError about spacy lan_model not supported
+ Bump dependencies to latest available using
poetry lock
Related Issue
Type of Change
Checklist
CODE_OF_CONDUCT.md
document.CONTRIBUTING.md
guide.make format-code
.