We have our tab data spread across multiple files. However, the files have some overlap in the columns that they include. Currently, minimum_data_R() combines data from multiple files with a dplyr::full_join(), which causes duplicate columns to be suffixed with .x and .y respectively. As a result, the columns are essentially missing downstream, where such suffixes aren't expected.
Could minimum_data_R() be updated to be a bit smarter about this, and in the presence of duplicate columns for example use the one from the file specified last in the list?
We have our tab data spread across multiple files. However, the files have some overlap in the columns that they include. Currently,
minimum_data_R()
combines data from multiple files with adplyr::full_join()
, which causes duplicate columns to be suffixed with.x
and.y
respectively. As a result, the columns are essentially missing downstream, where such suffixes aren't expected.Could
minimum_data_R()
be updated to be a bit smarter about this, and in the presence of duplicate columns for example use the one from the file specified last in the list?