waldronlab / curatedMetagenomicDataCuration

Sample Metadata Curation for curatedMetagenomicData
https://waldronlab.io/curatedMetagenomicDataCuration/
28 stars 23 forks source link

XieH_2016 data, data accession number differ & disease status confused #58

Closed luzhang321 closed 2 years ago

luzhang321 commented 2 years ago

Hi

Sorry to bother you. I am confused with XieH_2016 dataset. In its related paper: "Shotgun Metagenomics of 250 Adult Twins Reveals Genetic and Environmental Impacts on the Gut Microbiome". It is mentioned " DATA AND SOFTWARE AVAILABILITY The accession number for metagenomic shotgun sequencing data for all 250 samples after removal of human sequences reported in this paper is European Bioinformatic Institute (EBI): ERP010708. Other relevant data have been deposited to the GigaScience Database (GigaDB) (http://dx.doi.org/10.5524/100253)." https://www.ebi.ac.uk/ena/browser/view/PRJEB9584?show=reads This data file only have 211 records. I checked the record in cMD sampleMetadata, I found it recorded another accesion number : ERP010700 https://www.ebi.ac.uk/ena/browser/view/PRJEB9576?show=reads In this data file, it has 250 records. Why they're different?

Another one is about the disease status recorded. In the sampleMetadata, it has disease "migraine;asthma" recorded. for example in subject_id YSZC12003_35387. Where is the status recorded in paper? I searched the supp.table1 in this paper : Table S1. The TwinsUK Cohort for Metagenomic Sequencing, Related to Figure 1 (A) Phenotypic information for the twins But I couldn't find neither migraine nor asthma. another example is subject_id: YSZC12003_35365, it is recorded in column "Has a doctor ever diagnosed or treated you for any of the following conditions? \ Diabetes" as Yes. Why this diabetes is not included in the sampleMetadata?

Thanks a lot!

Looking forward to your reply!

lwaldron commented 2 years ago

pinging @paolinomanghi

paolinomanghi commented 2 years ago

Hi. Data on asthma were taken from https://pubmed.ncbi.nlm.nih.gov/30208875/ together with other metadata. Is the same cohort. Data on migraine were taken from https://pubmed.ncbi.nlm.nih.gov/32083024/.