tidyverse / multidplyr

A dplyr backend that partitions a data frame over multiple processes
https://multidplyr.tidyverse.org
Other
641 stars 75 forks source link

output to separate files on each cluster #53

Closed ChiWPak closed 7 years ago

ChiWPak commented 7 years ago

Not an issue but I'd like to output data in party_df objects to separate files on each cluster. With data.frames, I can do this: library(readr) data %>% write_tsv( ., file ) But write_tsv is not compatible with party_df. Anyone get this to work?

Ax3man commented 7 years ago

Use something like mtcars %>% partition(cyl) %>% do(readr::write_tsv(., paste0(.$cyl[1], '.tsv'))).

These kinds of questions may be better suited for stackoverflow.

ChiWPak commented 7 years ago

Thanks for the response. I'm closing this issue. (Sorry for posting here. Just didn't see a lot of posts on multidplyr on stackoverflow...)