Closed avsdev-cw closed 2 years ago
As per title:
cluster <- multidplyr::new_cluster(parallel::detectCores()) someFunc <- function(data, cluster) { data %>% dplyr::group_by(cyl) %>% multidplyr::partition(cluster) %>% dplyr::summarise(avg_mpg = mean(mpg)) %>% dplyr::collect() } someFunc(mtcars, cluster)
If you are lucky 2 or more cores (main+workers) will randomly get used, if not then only the main thread gets used.
+1, it seems there's a lot of data copying and other things going on in the background. Would love to see this resolved.
Duplicate of #87
As per title:
If you are lucky 2 or more cores (main+workers) will randomly get used, if not then only the main thread gets used.