Texera / texera

Collaborative Machine-Learning-Centric Data Analytics Using Workflows
https://texera.github.io
Apache License 2.0
163 stars 73 forks source link

Introduce Vectorizer to ML models #2670

Closed aglinxinyuan closed 4 months ago

aglinxinyuan commented 5 months ago

This PR introduces CountVectorizer and TfidfTransformer such that ML operators can take text feature as input and train the models.

Screenshot 2024-05-20 at 13 54 16