maelstrom-research / Rmonize

3 stars 0 forks source link

Summary report updates: Harmonized dataset summary report > Variables summary (all) #86

Open twey2 opened 2 months ago

twey2 commented 2 months ago

Harmonized dataset summary report > Variables summary (all)

1) Column titles and order should be: Index Grouping variable: adm_study_id Variable name Variable label Mlstr_harmo::status Quality assessment comment Data dictionary valueType Dataset valueType Suggested valueType Categorical variable Categories in data dictionary Number of rows Number of valid values Number of non-valid values Number of empty values % Valid values % Non-valid values % Empty values Number of distinct values

2) ‘Grouping variable: adm_study_id’ is moved to column B (after ‘Index’). 3) ‘Mlstr_harmo::status’ is added as column E (after ‘Variable label’). 4) For columns “Number of rows” to “% Empty values”: Validate carefully how they are calculated and that they match the updated column titles.

See attached mock-up file for reference. summary_report_harmo_validated.xlsx

GuiFabre commented 1 month ago

@twey2

Add the harmo status is not strait forward, I will find a solution. The rest is ok to be tested update : a solution was found, to be tested :)