cardillo / joinery

Data frames for Java
https://joinery.sh
GNU General Public License v3.0
700 stars 166 forks source link

sorting / grouping implementation efficiency improvements #40

Open cardillo opened 9 years ago

cardillo commented 9 years ago

such as avoiding copies, making efficient use of memory, parallelization, etc.

cardillo commented 9 years ago

see http://wesmckinney.com/blog/mastering-high-performance-data-algorithms-i-group-by/ for some insight into how this is done in pandas.