Closed karafecho closed 1 year ago
Update from new run, 08.17.2022, 2016 and 2020 datasets (v3, Hong's second run):
This issue is to report bugs that were identified during testing of the ICEES PCD datasets.
Update from new run, 08.4.2022, 2016 and 2022 datasets (v4, Hong's third run):
JAN2023 tests of v6 datasets:
Note that I ran a variety of summary statistics on the datasets for years 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2018, 2020, and 2021. I looked at key demographics, exposures, visits, diagnoses, and meds within each dataset. My tests were semi-systematic.
In sum, @hyi and @maximusunc, I think we're set to move forward with new ICEES+ and ICEES KG PCD deployments, but only after a decision is made re (1) and (4). If the "empty" variables show up as "null" in the APIs, then I think we should be fine, but I'd prefer to get your input before making a decision.
I split Confirmed_Dx into three variables, Confirmed_CF_Dx, Confirmed_IdiopathicBronchiectasisDx, and Confirmed_PCD_Dx, corresponding to those in that all_features YAML. I then copied the files to hop.renci.org at /projects/ebcr/pcd/data/patient/v6_rev_csv_files
.
@maximusunc : The new v6 pcd datasets are located on hop.renci.org at /projects/ebcr/pcd/data/patient/v6_rev_csv_files.
@hyi : ICEES+ PCD is now ready for redeployment with the new datasets.
Closing as this is complete ...
This issue is to report bugs that were identified during testing of the ICEES PCD datasets (v2, Hong's first run).