coderxio / sagerx

Open drug data pipelines curated by pharmacists.
https://coderx.io/sagerx
Other
49 stars 13 forks source link

Modify get_file_in_data_folder to account for files within unzipped files #304

Open jrlegrand opened 4 months ago

jrlegrand commented 4 months ago

Replace FDA UNII get_file_name function with standard get_files_in_data_folder + filter

          You can use the get_files_in_data_folder function already there to pull all of the downloaded data files. It returns a list so you can iterate through it here in the DAG to find the match

_Originally posted by @lprzychodzien in https://github.com/coderxio/sagerx/pull/300#discussion_r1664623979_

jrlegrand commented 4 months ago

Note @lprzychodzien - this solution would probably not work without modifying the get_files_in_data_folder to account for zip files... which we can do. I just thought it was an easy fix and doesn't look that way.

image