guidelines_root_sample = PMIDs for guidelines we are evaluating in study. guidelines_citation = Col 1: Guideline PMID ; Col 2: Cited PMID (column one is key to guidelines_root_sample file) datacollection = metadata we want to collect for each guideline PMID and each cited PMID