opensafely-core / opencodelists

OpenCodelists is an open platform for creating and sharing codelists of clinical terms and drugs.
https://www.opencodelists.org
Other
31 stars 11 forks source link

Produce report on unknown codes in TPP database #1394

Closed inglesp closed 2 years ago

inglesp commented 2 years ago

For each of the following columns in TPP:

we want to know whether the column contains any codes that are not in the version of the underlying coding system that's currently being used in OpenCodelists.

For each column, we should produce a report with counts of unknown codes, and a graph of the number of unknown codes over time.

This report should be generalisable to other backends in future.

inglesp commented 2 years ago

Blocked while we wait clarity on whether we can use SQL Runner for this.

milanwiedemann commented 2 years ago

at this point, do we want to report/visualise each specific code that is unknown by month (e.g., 2022-01: Code123, Code456) or just the sum of unknown codes, (e.g., 2022-01: 2 unknown codes)?

inglesp commented 2 years ago

Could we do both?

inglesp commented 2 years ago

Analysis here: https://github.com/opensafely/unknown-codes-investigation/issues/4.