Bioconductor / GenomeInfoDb

Utilities for manipulating chromosome names, including modifying them to follow a particular naming style
https://bioconductor.org/packages/GenomeInfoDb
31 stars 13 forks source link

Registration request for Acyrthosiphon pisum NCBI assembly #36

Closed Scottyzy closed 2 years ago

Scottyzy commented 2 years ago

Hi, I'd like to have the genome for Acyrthosiphon pisum registered for the purpose of forging a BSgenome package. The assembly is pea_aphid_22Mar2018_4r6ur and below is the link to the NCBI page:

https://www.ncbi.nlm.nih.gov/assembly/GCF_005508785.1

Let me know if there's anything I can clear up or help with. Thank you very much.

hpages commented 2 years ago

Done in GenomeInfoDb 1.31.2 (see commit 513a854cb990dc6229f86af29f0fbc389bcb2a7f).

Note that whoever submitted the pea_aphid_22Mar2018_4r6ur assembly to NCBI messed it up:

The Acyr_2.0 assembly from the Baylor College of Medicine seems cleaner (but is also a few years older). Also registered in GenomeInfoDb:

Seqinfo(genome="Acyr_2.0")
# Seqinfo object with 23925 sequences (1 circular) from Acyr_2.0 genome:
#   seqnames      seqlengths isCircular   genome
#   MT                 16971       TRUE Acyr_2.0
#   Scaffold1        2622152      FALSE Acyr_2.0
#   Scaffold2        2384549      FALSE Acyr_2.0
#   Scaffold3        1975979      FALSE Acyr_2.0
#   Scaffold4        2478080      FALSE Acyr_2.0
#   ...                  ...        ...      ...
#   Scaffold23920        605      FALSE Acyr_2.0
#   Scaffold23921        430      FALSE Acyr_2.0
#   Scaffold23922       1480      FALSE Acyr_2.0
#   Scaffold23923        812      FALSE Acyr_2.0
#   Scaffold23924        683      FALSE Acyr_2.0

H.

Scottyzy commented 2 years ago

Hi I was wondering how long before the assembly is available to use, as when i tried to run my seed files it said is not registered NCBI assembly. Thank you

hpages commented 2 years ago

Well, you need GenomeInfoDb 1.31.2. You can install it now (by installing it directly from GitHub) or wait that it becomes available in BioC 3.15 (this is BioC devel, it requires R 4.2), at which point you'll be able to install it via BiocManager::install(). This should take between 24h-48h.