Closed vblagoje closed 2 months ago
I've managed to get the CI to pass. Note the changes in dependencies. It couldn't be done without these and we need to pin a few more dependencies which is ok.
The nltk tests that were failing are related to inability to load old models in pickle files, which I think is forbidden now in nltk 3.9.x
I'll upgrade this PR draft into a PR
I would make this limitation a bit more evident.
tokenizer_model_folder
, we can log a clear warning.upgrade
entry in the release note.I would make this limitation a bit more evident.
- if we don't want to suppress the parameter
tokenizer_model_folder
, we can log a clear warning.- let's also add an
upgrade
entry in the release note.
I opted for always None-ing tokenizer_model_folder
and logging the warning with resolution path. This way we don't have to touch the codebase much and cause some unintended consequences. LMK if you have a better proposal @julian-risch @anakin87
OK for me.
I would prefer that @julian-risch also take a look.
Makes sense 🙏