General Issue: Verify template results

pharmaverse / admiral

ADaM in R Asset Library

https://pharmaverse.github.io/admiral

Apache License 2.0

220 stars 62 forks source link

General Issue: Verify template results #2506

Open StefanThoma opened 6 days ago

StefanThoma commented 6 days ago

Background Information

It would be ideal to have a way to verify whether the outputs of the templates have changed.

One idea would be to write unit tests, so that running the unit tests would already check that. I think it could be done like this in a unit test:

Create new file using template
(Potentially) modify the new file to reduce the dataset size used
Source the file
Compare the newly created file to the one created last time using diffdf

Definition of Done

Either find a way of verifying the outputs of the templates, or to decide that this is not necessary.

bms63 commented 5 days ago

I would be hesitant to make these into unit tests as they are very big files to run.

The current CI check for the templates is a separate workflow. I was thinking we would build off that one - pharmaverse/admiralci/.github/workflows/check-templates.yml@main

StefanThoma commented 5 days ago

I see your point. The downside with a separate run is that you cannot run it locally, afaik.

bms63 commented 5 days ago

What about how Edoardo approached that new check in the blog? He sources a file in the workflow - maybe something to consider? Then you could grab the file and run it

StefanThoma commented 5 days ago

Yeah that would not be a bad idea. Let me experiment a bit.

bundfussr commented 5 days ago

The downside with a separate run is that you cannot run it locally, afaik.

We could implement a function which runs a specified list of templates and compares the results with the references datasets (in pharmaverseadam?).

This function could then be used in the CI/CD workflow and also be called locally.

StefanThoma commented 5 days ago

The downside with a separate run is that you cannot run it locally, afaik.

We could implement a function which runs a specified list of templates and compares the results with the references datasets (in pharmaverseadam?).

This function could then be used in the CI/CD workflow and also be called locally.

That sounds like a good idea.