ga4gh / ga4gh-schemas

Models and APIs for Genomic data. RETIRED 2018-01-24
http://ga4gh.org
Apache License 2.0
214 stars 114 forks source link

Individual region/ethnicity modeling #810

Open mcourtot opened 7 years ago

mcourtot commented 7 years ago

@david4096 provided an example from interchange of the Populations field of the 1kgenomes data:

                "Population": [
                    "GBR"
                ], 

This seems a little bit unclear as we don't know whether it refers to the collection of the sample or the country of origin of the individual or reflects ethnicity of the individual.

Based on experience in the GWAS catalog, it may be useful to consider having:

BROAD ANCESTRAL CATEGORY: Broad ancestral category to which the individuals in the sample belong
COUNTRY OF ORIGIN: Country of origin of the individuals in the sample
COUNTRY OF RECRUITMENT: Country of recruitment of the individuals in the sample
ADDITONAL ANCESTRY DESCRIPTION: Any other information relevant to the sample description

The Ancestro could be used to formalise those fields.

cc @mbaudis

mcourtot commented 7 years ago

see also https://github.com/EBISPOT/ancestro/issues/2 https://github.com/EBISPOT/ancestro/issues/3