Data4Democracy / drug-spending

Project to understand pharmaceutical spending, currently focused on US government programs.
73 stars 46 forks source link

Issue #61: drugbank+part_d_spending semijoin and antijoin #83

Closed proof-by-accident closed 6 years ago

proof-by-accident commented 6 years ago

semijoin = all rows in part_d_spending2011to2015.csv which had brand or generic names in the drugbank.xml "synonyms" or "brand names" fields

antijoin = all rows in part_d_spending2011to2015.csv which DID NOT have brand or generic names in the drugbank.xml "synonyms" or "brand names" fields

darya-akimova commented 6 years ago

Thanks for this, merging!