Kotlin / dataframe

Structured data processing in Kotlin
https://kotlin.github.io/dataframe/overview.html
Apache License 2.0
784 stars 50 forks source link

Describe breaks on `Number` column (and other statistics inconsistencies) #558

Open Jolanrensen opened 6 months ago

Jolanrensen commented 6 months ago

This happens because the Iterable<Number>.std() function accepts Number but doesn't convert them to Double (like mean() does).

There are a couple more missing actually:

All are also missing BigInteger as we're supporting BigDecimal too.

koperagen commented 6 months ago

https://github.com/Kotlin/dataframe/issues/352 probably same problem

Jolanrensen commented 6 months ago

As mentioned here https://github.com/Kotlin/dataframe/issues/543, some functions like median(ints) might result in an unexpectedly rounded Int in return. It might be better to let all functions return Double and then handle BigInteger / BigDecimal separately for now, as they're java-specific for now.

zaleslaw commented 3 months ago

It looks like an umbrella ticket and should be split to a smaller task