biolab / orange3

🍊 :bar_chart: :bulb: Orange: Interactive data analysis
https://orangedatamining.com
Other
4.85k stars 1.01k forks source link

Preprocessing: difference in preprocessing pipelines #5444

Closed ajdapretnar closed 3 years ago

ajdapretnar commented 3 years ago

But actually the first workflow overrides the default preprocessing of the learner, which causes errors sometimes.

Try heart-disease with Logistic Regression. It fails and the error is unclear for a beginner (ValueError: Wrapped scikit-learn methods do not support multinomial variables.). Adding Continuize solves the problem, but this is not clear from the error.

Proposed solutions:

janezd commented 3 years ago

Write this in the docs.

... in Preprocess and in Test and Score.