Closed mikerobeson closed 1 year ago
I think this can also be achieved by using the NCBI Datasets (see #96), which I started working on a while ago (https://github.com/misialq/RESCRIPt/blob/ncbi-datasets/rescript/ncbi_datasets.py) - using a taxon ID as a query one can pull all assembled genomes, together with their metadata and taxonomies (all using the Datasets API, so no need to do much additional parsing as far as I remember)... Although, it does require some new semantic types defined in https://github.com/bokulich-lab/q2-types-genomics (for storing the genome annotations).
@misialq is this PR closed by #153 ?
Yes, it is!
Provide the ability for users to download RefSeq Genome Assemblies, along with their associated taxonomy. See this forum thread for more details.
Some notes:
(txid2[orgn] OR txid2157[orgn]) AND "latest_refseq"[Properties]