Kotlin / dataframe

Structured data processing in Kotlin
https://kotlin.github.io/dataframe/overview.html
Apache License 2.0
772 stars 48 forks source link

Apache Feather Categorical Data Type like in Pandas #591

Open StartedFromTheButton opened 4 months ago

StartedFromTheButton commented 4 months ago

Hey I would like to save my enums as Categorical Values in an Apache Feather file.

Jolanrensen commented 4 months ago

Hi, our Arrow writer doesn't currently support such a type, but you could try extend it with a proof of concept yourself to see if it's possible at all :)

StartedFromTheButton commented 4 months ago

@Jolanrensen I'd love to. What is the current contributor situation for this project? Is this a JetBrains employees only project?

Jolanrensen commented 4 months ago

@StartedFromTheButton Anyone is free to join :) All the help is welcome, as seen in the contribution guide. Of course, the maintainers have the final say in what to merge and when, but anyone can make issues or PRs :).

Jolanrensen commented 4 months ago

Slightly relevant: We recommend users to use https://github.com/Kotlin/dataframe/blob/master/core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/api/enum.kt whenever they use enums for easier String->Enum and Enum->String conversions in DF. If it's relevant for you, try to incorporate it :).