Just realizing that the OOPD file has multiple duplicate drug-disease pairs. For example, there are four lines that indicate that ibrutinib treats Chronic lymphocytic leukemia (which differ based on approval dates). Currently the regression test treats them as four independent tests, and ideally we'd do a pre-filter step to only take unique drug-disease pairs.
Just realizing that the OOPD file has multiple duplicate drug-disease pairs. For example, there are four lines that indicate that
ibrutinib
treatsChronic lymphocytic leukemia
(which differ based on approval dates). Currently the regression test treats them as four independent tests, and ideally we'd do a pre-filter step to only take unique drug-disease pairs.