issues
search
GlobalMaksimum
/
sadedegel
A General Purpose NLP library for Turkish
http://sadedegel.ai
MIT License
93
stars
15
forks
source link
Pre-processing options work for PreTrainedVectorizers [resolves #307]
#308
Open
ertugrul-dmr
opened
2 years ago
ertugrul-dmr
commented
2 years ago
Added proper ways to use pre-processing in transformer based vectorizers where they used to work on raw strings.
Added ability to use pre-trained vectorizers and Text2Doc together in pipelines.
Text2Doc settings should be properly changing after altering the settings.
Added test cases for changes mentioned above.