human-pangenomics / hpgp-data

Data from the Human PanGenomics Project
Other
60 stars 4 forks source link

Information about the samples #12

Open Paula27222 opened 2 years ago

Paula27222 commented 2 years ago

Hello, I would like to ask a bit of information about the samples that are in this repository. Are they control samples? And secondly, I found this repository from the Pangenomics human project. https://s3-us-west-2.amazonaws.com/human-pangenomics/index.html?prefix=submissions/ What exactly are these samples?

Thanks in advanced. Best, Paula

skoren commented 2 years ago

All the samples are from the 1000 genome projects and are also available in coriel. For example: https://www.internationalgenome.org/data-portal/sample/HG01109 https://www.coriell.org/0/Sections/Search/Sample_Detail.aspx?Ref=HG01109&PgId=166

These 10 were selected to capture common alleles not in GRCh38 and were a sort of pilot for the larger HPGP project (see also HPRC_PLUS). The repository you pointed to includes all the HPGP samples which are described here: https://github.com/human-pangenomics/HPP_Year1_Data_Freeze_v1.0