Pierre-Wallet commented 8 months ago

17 creation of metadata-creation.R to handle the building of metadatasets, their storage in data folder and their documentation skeleton in R/data.R. #17 creation of 10 metadatasets to reflect WHO website, with their documentation manually updated. #17 creation of snapshot tests for each metadatsets created

Thank you for your Pull Request! We have developed this task checklist from the Development Process Guide to help with the final steps of the process. Completing the below tasks helps to ensure our reviewers can maximize their time on your code as well as making sure the admiral codebase remains robust and consistent.

Please check off each taskbox as an acknowledgment that you completed the task or check off that it is not relevant to your Pull Request. This checklist is part of the Github Action workflows and the Pull Request will not be merged into the main branch until you have checked off each task.

[x] Place Closes # into the beginning of your Pull Request Title (Use Edit button in top-right if you need to update)
[x] Code is formatted according to the tidyverse style guide. Run styler::style_file() to style R and Rmd files
[x] Updated relevant unit tests or have written new unit tests, which should consider realistic data scenarios and edge cases, e.g. empty datasets, errors, boundary cases etc. - See Unit Test Guide
[x] If you removed/replaced any function and/or function parameters, did you fully follow the deprecation guidance?
[x] Update to all relevant roxygen headers and examples, including keywords and families. Refer to the categorization of functions to tag appropriate keyword/family.
[x] Run devtools::document() so all .Rd files in the man folder and the NAMESPACE file in the project root are updated appropriately
[x] Address any updates needed for vignettes and/or templates
[x] Update NEWS.md under the header # admiralpeds (development version) if the changes pertain to a user-facing function (i.e. it has an @export tag) or documentation aimed at users (rather than developers)
[x] Build admiralpeds site pkgdown::build_site() and check that all affected examples are displayed correctly and that all new functions occur on the Reference page.
[x] Address or fix all lintr warnings and errors - lintr::lint_package()
[x] Run R CMD check locally and address all errors and warnings - devtools::check()
[x] Link the issue in the Development Section on the right hand side.
[x] Address all merge conflicts and resolve appropriately
[x] Pat yourself on the back for a job well done! Much love to your accomplishment!

github-actions[bot] commented 8 months ago

Package	Line Rate	Health
admiralpeds	100%	✔
Summary	100% (5 / 5)	✔

zdz2101 commented 8 months ago

from #15 https://github.com/pharmaverse/admiralpeds/pull/16#discussion_r1517893616 we will want to adopt something of this nature

Pierre-Wallet commented 8 months ago

@zdz2101 , the update is done. Everything seems clear to me.

manciniedoardo commented 8 months ago

FYI that the data_raw folder is also being created in this PR

zdz2101 commented 8 months ago

@Pierre-Wallet can you take a look at: https://github.com/pharmaverse/admiralpeds/pull/16

What we'll do is instead of exporting each dataset, we write them into the sysdata.rda file to store them internally, as for what needs to be done here, can you:

Modify the data.R file and turn it into something like get_who_data.R that creates a function called get_who_data() that essentially is a wrapped switch statement that fetch the respective dataset that the user may need
remove each dataset .rda object
we probably will have to add a separate script in data-raw such that it runs both the cdc_metadata_creation.R and who_metadata_creation.R and at the bottom has one line that saves all the objects to sysdata, it'll look something like this:
```
source("cdc_metadata_creation.R")
source("who_metadata_creation.R")
usethis::use_data(cdc_wtage, cdc_htage, cdc_bmiage, {all your who_dataobjects}, overwrite = TRUE, internal = TRUE)
```
modify testthat files to call get_who_data() instead of calling the object directly

rossfarrugia commented 7 months ago

No problem @zdz2101 - sounds a good plan, and thanks for @Pierre-Wallet here for thorough review and input to guide us to the ideal strategy. For now please review each others PRs and merge them once ready and when I'm back I'll put a reminder to do some additional checks.

Pierre-Wallet commented 7 months ago

In terms of house-keeping though, can you separate the who_metadata_creation.R into its individual 10 files in the data-raw folder? Each individual dataset can have a corresponding file-to-rda object

If I understood correctly, you want 10 .R files, each of them building an .rda object instead of having the building of the ten datasets in one .R file as it currently is? To be more precise, do you want to :

keep who_metadata_creation which sources 10 sub programs
or do you want to have directly 10 programs in data-raw/?

One question is: do we give up the idea of having all the .rda in one internal .rda?

If yes with option 1, we could have it at the end of who_metadata_creation after the 10 subcalls.
if yes with option 2, we could have it in a dedicated program
if no, well each .rda will be built in each sub program.

zdz2101 commented 7 months ago

I like 1 the most but actually on second thought, that brings us back to the same problem with how to document all the objects since they won't exported to the namespace, can you just follow the structure pharmaversesdtm has it? So 1 dedicated program for each object and each dataset have its own .rda, so option 2 and abandoning the internal sysdata.rda object

Pierre-Wallet commented 7 months ago

I will test the options tomorrow. I think it is possible to get all datasets in one internal .rda and still have a documentation for each dataset. I keep you posted

Pierre-Wallet commented 7 months ago

Hi @zdz2101 , I just pushed my updates. Please have a look to it, I eventually created one pgm for each .rda with its own documentation. The only issue I have is a lintr one, which does not accept the usage of source().

pharmaverse / admiralpeds

closes #17 creation of data-raw folder to handle metadata creation #21