Currently ctparse depends on numpy, scikit-learn and scipy - only for the relatively simple naive Bayes and vectorizer. It would reduce package size and issues integrating with other libs significantly when we would remove these deps - by simply having a private implementation.
Only caveat: the naive Bayes is called very often and at least it prediction runtime is crucial for the speed of ctparse. Any replacement should not be slower than what we currently have.
Currently
ctparse
depends onnumpy
,scikit-learn
andscipy
- only for the relatively simple naive Bayes and vectorizer. It would reduce package size and issues integrating with other libs significantly when we would remove these deps - by simply having a private implementation.Only caveat: the naive Bayes is called very often and at least it prediction runtime is crucial for the speed of
ctparse
. Any replacement should not be slower than what we currently have.