biobakery / humann

HUMAnN is the next generation of HUMAnN 1.0 (HMP Unified Metabolic Analysis Network).
http://huttenhower.sph.harvard.edu/humann
Other
170 stars 58 forks source link

about retrieving the sequences(or representive sequences) of the gene families #53

Closed IRSADFERHAT closed 1 year ago

IRSADFERHAT commented 1 year ago

Sir/Madam, hi. I am trying to retrieve the sequences or/and representative sequences of the uniref90 gene families and species in the samplesgenefamilies.tsv output file from humann3. I found that the uniref90 accession names in the files are not in the uniref90 database. I found that these names are constructed in the following way: 'Uniref90' + UniProtKB accession number. Such as these ones: UniRef90_C6JF93; UniRef90_R7PFX; UniRef90D4LFR6. I am uncertain what to do now. Should I just ignore the 'Uniref90' part in the name and query the sequences only using the following UniProtKB accession numbers? And can I retrieve the sequences of the specific species within the same family? Best regard Irsat

github-actions[bot] commented 1 year ago

Thank you for creating this issue. We currently field issues through our bioBakery Discourse Support Forum. If you would please post the issue to discourse we would be happy to sync up with you to get it resolved.