elastic / kibana

Your window into the Elastic Stack
https://www.elastic.co/products/kibana
Other
19.75k stars 8.15k forks source link

[ML] Data Frame Analytics: 'Is not included' filter not working as expected #188739

Open arisonl opened 2 months ago

arisonl commented 2 months ago

In Data Frame Analytics job setup, when unchecking a field X (e.g. Cancelled in the ss below) and then filter for is not included, the expected behavior is to see field X in that list.

Screenshot 2024-07-17 at 1 00 01 PM (1)

However that's not currently the case (Cancelled does not appear in the filtered list, see ss below): Screenshot 2024-07-17 at 1 02 40 PM

This is confusing. It is important to be clear on which variables are excluded before creating the model because including ones that are tightly correlated to the dependent variable is "cheating" and including ones that are correlated between them undermines the model's performance.

I believe that that list only shows the fields that were excluded automatically by the system and not by the users when they uncheck. If that's the case, we should make that clear or, even better, fix the filter to apply to all included/excluded fields. cc @alvarezmelissa87

elasticmachine commented 2 months ago

Pinging @elastic/ml-ui (:ml)