timriffe / covid_age

COVerAGE-DB: COVID-19 cases, deaths, and tests by age and sex
Other
56 stars 30 forks source link

Update documentation for qualityMetrics and offset.csv #143

Closed eshom closed 2 years ago

eshom commented 2 years ago

Hi!

Knowing the format of datasets is important in order to be able to correctly read them, of course. There is datasets information here: https://timriffe.github.io/covid_age/GettingStarted.html

But it's outdated on two accounts:

  1. A relatively new dataset offsets.csv is missing from the list.
  2. qualityMetrics.zip doesn't have 18 columns anymore, but 23, what are their types?

Is it possible to update their information?

Also, note from which file version there was a column number update. This is important to be able to handle reading different dataset versions.

This information is necessary for both eshom/covid-age-data#6 and eshom/covid-age-data#7

Thanks!

timriffe commented 2 years ago

Thanks, and apologies for the late reply- I've updated the getting started guide to account for these changes. offsets.csv was there from the start, but we never considered it an output because we use it instrumentally only. But I've added it anyway. The columns for offsets are subject to change, as I need to add a Code variable for better joining to the main data. Also I'll change the order at that time. That particular dataset is subject only to sporadic changes. Thanks again!