modAL-python / modAL

A modular active learning framework for Python
https://modAL-python.github.io/
MIT License
2.23k stars 324 forks source link

Support for pyspark.ml.classification estimators #190

Open peeyushsinghal opened 3 months ago

peeyushsinghal commented 3 months ago

Currently the library support sklearn (scikit-learn) estimators. How can I make it work for pyspark.ml.classification estimators. The example https://www.databricks.com/blog/2020/01/16/better-machine-learning-through-active-learning.html again uses pandas and scikit-learn based estimators, making it difficult to work with PySpark and classifiers. Please provide an example with native pyspark implementation