Adding support for Apache Arrow would greatly increase the interoperability between application developers and data scientists. https://arrow.apache.org/
It would be much more efficient to share data structures between dataframe-ec and many of the other dataframe like frameworks. It would also allow python users to move their work to Java for the more memory intensive operations.
Adding support for Apache Arrow would greatly increase the interoperability between application developers and data scientists. https://arrow.apache.org/
It would be much more efficient to share data structures between dataframe-ec and many of the other dataframe like frameworks. It would also allow python users to move their work to Java for the more memory intensive operations.
There is already a Java library for Arrow. https://arrow.apache.org/docs/java/ https://central.sonatype.com/search?q=g:org.apache.arrow%20%20v:17.0.0&smo=true
Thanks for the great work!