Closed mbrush closed 3 years ago
Next Steps:
Matt - The reason why ClinGen needs these population types is to allow them to support the ACMG guidelines when applying specific population allele frequency data as evidence for the criterion that requires pop allele frequency data to be met. The ACMG guidelines specifically references ExAC, 1000Genomes and ESP. If we do not have codes for the potentially referenced pop types then we would not be able to represent these 3 repos of pop allele frequency in our variant assessment. So, 1. is not just for ClinGen it is for any group that wants to apply the ACMG guidelines and do so while being able to represent the data in these 3 repos. Is that reasonable enough to cover #1 and #2 above? Or would you need to hear this from an ACMG representative. I have heard that ESP data is not necessarily as useful. I think this is since the emergence of gnomad and other sources of pop allele freq data. But, there may still be those that want to follow the ACMG recommendation by the book, so even if this is so, it makes sense to provide supporting codes for the population types that can be applied computationally for this data.
Thanks Larry - was mainly recording this ticket for the benefit of Alice Popejoy and Lyuba Remennik as context for my upcoming calls with them., where we will work to get these terms into the NCIt. In the meantime I will put placeholders in SEPIO, and keep you posted when/if we get NCIt replacements.
As part of our efforts to evolve SEPIO to support ClinGen variant interpretation data, we need classes for 'Population' types described in the gnomAD/Broad, IGSR, and ESP exome sequencing databases.
We would like to use the NCIt as the home for these classes, as the 'Population Group' hierarchy here seems broad enough to cover all population types defined in these systems based on race, ethnicity, and geography. Many terms will map to existing classes, but for many new terms would have to be created in the NCIt.
Lyuba Remennik from the NCIt's EVS has indicated that they are interested in expanding the representation of population groups in NCIt to support epidemiologic research as well as the looming Data Commons. So we can work with the NCIt and other efforts needing Population/Group value sets in this work.