jorainer / ensembldb

This is the ensembldb development repository.
https://jorainer.github.io/ensembldb
33 stars 10 forks source link

org.Ss.eg.db or Atlantic salmon database? #144

Open marwa38 opened 1 year ago

marwa38 commented 1 year ago

Could you please let me know if you will possibly create an Ensembl database for Atlantic salmon (Salmo salar) e.g. for human is org.Hs.eg.db? I found this org.Ss.eg.db for pig. Thanks Marwa

jorainer commented 1 year ago

Note that there are already annotation databases for Salmo sarar available in Bioconductor's AnnotationHub:

> library(AnnotationHub)
Loading required package: BiocGenerics

Attaching package: ‘BiocGenerics’

The following objects are masked from ‘package:stats’:

    IQR, mad, sd, var, xtabs

The following objects are masked from ‘package:base’:

    anyDuplicated, aperm, append, as.data.frame, basename, cbind,
    colnames, dirname, do.call, duplicated, eval, evalq, Filter, Find,
    get, grep, grepl, intersect, is.unsorted, lapply, Map, mapply,
    match, mget, order, paste, pmax, pmax.int, pmin, pmin.int,
    Position, rank, rbind, Reduce, rownames, sapply, setdiff, sort,
    table, tapply, union, unique, unsplit, which.max, which.min

Loading required package: BiocFileCache
Loading required package: dbplyr
> ah <- AnnotationHub()
snapshotDate(): 2022-10-26
> query(ah, "Salmo salar")
AnnotationHub with 73 records
# snapshotDate(): 2022-10-26
# $dataprovider: Ensembl, FANTOM5,DLRP,IUPHAR,HPRD,STRING,SWISSPROT,TREMBL,E...
# $species: salmo salar, Salmo salar
# $rdataclass: TwoBitFile, GRanges, EnsDb, SQLiteFile, OrgDb
# additional mcols(): taxonomyid, genome, description,
#   coordinate_1_based, maintainer, rdatadateadded, preparerclass, tags,
#   rdatapath, sourceurl, sourcetype 
# retrieve records with, e.g., 'object[["AH78891"]]' 

             title                                           
  AH78891  | Ensembl 99 EnsDb for Salmo salar                
  AH79444  | Salmo_salar.ICSASG_v2.99.abinitio.gtf           
  ...        ...                                             
  AH107261 | LRBaseDb for Salmo salar (Atlantic salmon, v004)
  AH107424 | org.Salmo_salar.eg.sqlite                       

Or if you are specifically interested in an EnsDb database:

> query(ah, c("EnsDb", "Salmo salar"))
AnnotationHub with 9 records
# snapshotDate(): 2022-10-26
# $dataprovider: Ensembl
# $species: Salmo salar
# $rdataclass: EnsDb
# additional mcols(): taxonomyid, genome, description,
#   coordinate_1_based, maintainer, rdatadateadded, preparerclass, tags,
#   rdatapath, sourceurl, sourcetype 
# retrieve records with, e.g., 'object[["AH78891"]]' 

             title                            
  AH78891  | Ensembl 99 EnsDb for Salmo salar 
  AH79796  | Ensembl 100 EnsDb for Salmo salar
  ...        ...                              
  AH100766 | Ensembl 106 EnsDb for Salmo salar
  AH104987 | Ensembl 107 EnsDb for Salmo salar

You can then get the one e.g. for Ensembl 107 using:

> query(ah, "Salmo salar")
AnnotationHub with 73 records
# snapshotDate(): 2022-10-26
# $dataprovider: Ensembl, FANTOM5,DLRP,IUPHAR,HPRD,STRING,SWISSPROT,TREMBL,E...
# $species: salmo salar, Salmo salar
# $rdataclass: TwoBitFile, GRanges, EnsDb, SQLiteFile, OrgDb
# additional mcols(): taxonomyid, genome, description,
#   coordinate_1_based, maintainer, rdatadateadded, preparerclass, tags,
#   rdatapath, sourceurl, sourcetype 
# retrieve records with, e.g., 'object[["AH78891"]]' 

             title                                           
  AH78891  | Ensembl 99 EnsDb for Salmo salar                
  AH79444  | Salmo_salar.ICSASG_v2.99.abinitio.gtf           
  ...        ...                                             
  AH107261 | LRBaseDb for Salmo salar (Atlantic salmon, v004)
  AH107424 | org.Salmo_salar.eg.sqlite                       
> query(ah, c("EnsDb", "Salmo salar"))
AnnotationHub with 9 records
# snapshotDate(): 2022-10-26
# $dataprovider: Ensembl
# $species: Salmo salar
# $rdataclass: EnsDb
# additional mcols(): taxonomyid, genome, description,
#   coordinate_1_based, maintainer, rdatadateadded, preparerclass, tags,
#   rdatapath, sourceurl, sourcetype 
# retrieve records with, e.g., 'object[["AH78891"]]' 

             title                            
  AH78891  | Ensembl 99 EnsDb for Salmo salar 
  AH79796  | Ensembl 100 EnsDb for Salmo salar
  ...        ...                              
  AH100766 | Ensembl 106 EnsDb for Salmo salar
  AH104987 | Ensembl 107 EnsDb for Salmo salar
> edb <- ah[["AH104987"]]
downloading 1 resources
retrieving 1 resource
  |======================================================================| 100%

loading from cache
require(“ensembldb”)

If you are however looking for the org database you could get the one listed further above.

db <- ah[["AH107424"]]