Open ctb opened 1 month ago
This dataset is not in the index, likely due to it having "amplicon" in the metadata (abstract):
... datasets of sequenced bacterial 16S rRNA gene amplicons and total fecal ...
https://trace.ncbi.nlm.nih.gov/Traces/index.html?view=run_browser&acc=SRR073439&display=metadata
which connects with the discussion in https://github.com/sourmash-bio/branchwater/issues/24#issuecomment-2067814713 =]
It's in wort,
/group/ctbrowngrp/irber/data/wort-data/wort-sra/sigs/SRR073439.sig
It's in wort,
/group/ctbrowngrp/irber/data/wort-data/wort-sra/sigs/SRR073439.sig
Yes, see footnotes 1 and 2 here: https://github.com/sourmash-bio/branchwater/issues/24#user-content-fn-1-16d83edf852b4e8c4fb59f87c826ec58 https://github.com/sourmash-bio/branchwater/issues/24#user-content-fn-2-16d83edf852b4e8c4fb59f87c826ec58
The accession BK010471 is for a crAssphage that is ubiquitous in human gut metagenomes (link), and in particular is found in the 454 data set SRR073439.
When I do a containment search, I see:
and the Venn diagram is pleasing:
However, the FASTA sequence does not have any matches when searched at https://branchwater.jgi.doe.gov/. Any ideas?
thanks!
SRR073439.k31.sig.zip BK010471.k31.sig.zip BK010471.fa.zip