coderxio / sagerx

Open drug data pipelines curated by pharmacists.
https://coderx.io/sagerx
Other
45 stars 12 forks source link

New mart: Products to inactive ingredients (excipients) #300

Closed jrlegrand closed 3 months ago

jrlegrand commented 3 months ago

Resolves #296 Resolves #276

Explanation

Rationale

Removed duplicates for data quality / granularity.

Added preservative flag to facilitate people searching for products that contain preservatives. Can SUM this column in aggregate to identify products that contain any preservatives.

Names of excipients are not normalized in RxNorm. Oftentimes, it is just a matter of different capitalization that can be solved by lowercasing everything, but other times there are extra words or formatting differences. Normalizing the names is a better user experience and potentially better for reporting.

Tests

Ran mart and explored data in pgadmin.

jrlegrand commented 3 months ago

See attached file for excipients flagged as preservatives (per the USP methodology) that are in the mart at least once, with counts for how frequently they appear. Also has a link to PubChem to demonstrate this functionality and make it easier to review. Note: one missing PubChem link due to how the data exists from FDA UNII file.

preservatives_for_review.xlsx

jrlegrand commented 3 months ago

image