FINNGEN / kanta_lab_preprocessing

Repo for kanta lab QC
MIT License
1 stars 0 forks source link

Improving column names #15

Closed vincent-octo closed 1 month ago

vincent-octo commented 5 months ago

We could discuss about changing the column names, as in my opinion some are a bit confusing at the moment. The idea is to be more consistent in the naming, and avoid implicitness.

The suggested renaming would be based on — and differentiate between — these concepts:

This would mean we will not have the same column names as in Kira's output, but that's OK.

Suggested column renaming:

So in the end, we would change from this header:

FINREGISTRYID   LAB_DATE_TIME   LAB_SERVICE_PROVIDER    LAB_ID  LAB_ID_SOURCE   LAB_ABBREVIATION    LAB_VALUE   LAB_UNIT    LAB_ABNORMALITY REFERENCE_VALUE_TEXT    MEASUREMENT_STATUS

to this header:

FINREGISTRYID   MEASUREMENT_DATE_TIME   SERVICE_PROVIDER    TEST_ID TEST_ID_SOURCE  TEST_NAME_ABBREVIATION  MEASUREMENT_VALUE   MEASUREMENT_UNIT    RESULT_ABNORMALITY  TEST_REFERENCE_VALUE    RESULT_STATUS
piotor87 commented 4 months ago

Just one thing: for consistency with the input columns, I kept MEASUREMENT_STATUS because in the original column is grouped with the other ones:

'tutkimustulosarvo':              'MEASUREMENT_VALUE',
'tutkimustulosyksikko':           'MEASUREMENT_UNIT',
'tutkimusvastauksentila':         'MEASUREMENT_STATUS',
'tuloksenpoikkeavuus':            'RESULT_ABNORMALITY',