Fabiola-Eto / MULTIPLY-Initiative

This repository hosts codelists of long-term conditions addressed in a multimorbidity study developed by researchers from the Queen Mary University of London, Welcome Trust Sanger Institute and the London School of Hygiene and Tropical Medicine.
MIT License
7 stars 3 forks source link

Diabetes + diabetes complication codelist revisions #195

Closed finersarah closed 3 years ago

finersarah commented 3 years ago

Revision process:

@mmah-sh has collated all previous codelists for specification of diabetes type using EHR data, including CALIBER, OpenSAFELY, and others to try and refine our diabetes codelists. We will use this for further clinical curation, as follows:

  1. Clinical revision of diabetes type - important due to the number of diabetes codes that do not specify type. This refers to both diabetes diagnosis and complication codes:
    • I have allocated all codes to type 1 or type 2 diabetes if type 1 or type 2 specified in the clinical code
    • If a high probability of diabetes type can be inferred from code description (e.g. young onset diabetes with ketoacidosis) I have allocated to that type
    • If diabetes type not specified but clinical code indicates non-insulin-treated diabetes, I have allocated to type 2 diabetes

This leaves a large list which includes:

Next steps: @mmah-sh to further curate the unspecified diabetes types by doing the following: (a) Calculate the frequency of use of all codes in the unspecified/rare list (b) For those in the unspecified/rare list that are marked "needs revision", calculate the frequency of which an unspecific disease only code is superseded (i.e. a later date) by a specific disease only or complication code (c) For those in the unspecified/rare list that are marked "needs revision", calculate the frequency of which an unspecific complication code is preceded (i.e. an earlier date) by a specific disease only code

  1. Clinical revision of diabetes diagnosis vs complication lists
    • we still do not know if the codes which describe both a diabetes diagnosis and complication are also accompanied by a diagnosis only code. Currently, we duplicate codes across multiple lists which will overestimate prevalence.

Next steps: @mmah-sh to calculate the following for all complication codes: (a) Calculate the frequency of use of all complication codes (b) Calculate the frequency where a complication code and diagnosis only code co-occur (at any point in the EHR).

Fabiola-Eto commented 3 years ago

As discussed with @finersarah the Diabetes codelist (CPRD_Diabetes.csv, n= 390) will be split into three codelists:

The codelists were updated and uploaded to the branch codesets-included-conditions.

Fabiola-Eto commented 3 years ago

NB in CPRD GOLD data there are some patients with diabetes (n = 242,237) that received different combinations of diabetes diagnosis over time:

In agreement with @finersarah and @Miriam-S-git we decided that: