snayfach / UHGV

Unified Human Gut Virome Catalog
https://portal.nersc.gov/UHGV
Other
27 stars 1 forks source link

Host prediction #12

Closed snayfach closed 1 year ago

snayfach commented 1 year ago

Hadza identifiers don't match master table: Nepal_MoBio_Fiber-Hadza-Nepal_B_5_THA1056YZ.71 CRISPR spacers to all 3 databases kmers to only 2/3 databases

MySQL tables

To Do

Genome info counts (mysql = host_genomes):

host_taxonomy.tsv file: looks good!

Do I have hits to UHGG genomes absent from list?

snayfach commented 1 year ago

Reuse text from IMG/VR: https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkac1037/6833254