Closed domhauton closed 5 years ago
Hi,
I appreciate your attention to this!
I've always considered it poor form for a numeric utility to quietly coerce improper input like that. Slightly better would be to never switch out of discrete mode if the input is not a numeric type, but even this is another hidden assumption to deal with incorrect input. Cleanest to expect numeric data rather than guessing or coercing.
While processing a large data set (attached) the percentile calculations return an equation string.
By casting the centroid mean to a string we can prevent this issue.
Dataset: my_set.csv.zip
Test Code:
Result pre-fix:
Result post-fix: