coderxio / sagerx

Open drug data pipelines curated by pharmacists.
https://coderx.io/sagerx
Other
49 stars 13 forks source link

Parse full description from ASHP shortage detail page #325

Open jrlegrand opened 1 month ago

jrlegrand commented 1 month ago

Problem Statement

In addition to getting the NDC from this, also grab the full bullet point of text and associate it with the NDC.

Some parts like pack size would be useful to pull out by exploding by commas and comparing to other sources of pack size.

image

Criteria for Success

For both affected and available NDCs, add a column in the ndc model for "description" or something.

Additional Information

Needs to be in the bs4 code, plus an additional column in the dbt model.