Closed dhoogest closed 4 years ago
For record keeping these are the current formats:
https://www.ncbi.nlm.nih.gov/Sequin/acc.html
https://www.ncbi.nlm.nih.gov/books/NBK21091/table/ch18.T.refseq_accession_numbers_and_mole/?report=objectonly
Also keep in mind generated Refseq accessions do not follow the same rules as regular accessions. For example:
Has a Refseq prefix followed by only numerals.
For example,
NZ_CAADIT010000001
from accessionCAADIT010000001
is transformed by the regular expression in https://github.com/nhoffman/ya16sdb/blob/f429540104b6c151acd65a968a51446aa33fcd63/bin/extract_genbank.py#L26 toADIT01000000
. Results in duplicate records for this genome