Closed mitokic closed 1 year ago
Investigate how to leverage tools like dtplyr , arrow, and sparklyr packages to do larger data manipulation within the package.
Could create standard functions for mutate, select, group by, summarize that call different packages based on the hist data size.
https://www.business-science.io/code-tools/2019/08/15/big-data-dtplyr.html https://arrow.apache.org/docs/r/articles/dataset.html https://spark.rstudio.com/
Investigate how to leverage tools like dtplyr , arrow, and sparklyr packages to do larger data manipulation within the package.
Could create standard functions for mutate, select, group by, summarize that call different packages based on the hist data size.
https://www.business-science.io/code-tools/2019/08/15/big-data-dtplyr.html https://arrow.apache.org/docs/r/articles/dataset.html https://spark.rstudio.com/