Closed rth closed 4 years ago
Also, different possibilities of defining default and optional parameters (for vectorizers and other estimators in general) is discussed in https://github.com/rust-ml/discussion/issues/2
Resolved in https://github.com/rth/vtext/pull/57
Currently
CountVectorizer
andHashingVectorizer
mostly perform BOW token counting without the possibility to change the tokenizer or any other parameters.While we intentionally won't support all the parameter that scikit-learn versions does (as these meta-estimators are doing too much), additional parametrization would be preferable.