Creating annotation file tx2gene for NCBI human transcriptome

hbctraining / DGE_workshop_salmon_online

163 stars 75 forks source link

Hi @ellalalalalalala

I think for NCBI annotations you might better off using OrgDb. It will usually be the most current build, so using this will get you hg38 which it looks like you want? You will see that the data is current from Sept 2021.

query(ah, c("Homo sapiens", "OrgDb"))

AnnotationHub with 1 record
# snapshotDate(): 2021-10-20
# names(): AH95959
# $dataprovider: ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/
# $species: Homo sapiens
# $rdataclass: OrgDb
# $rdatadateadded: 2021-10-08
# $title: org.Hs.eg.db.sqlite
# $description: NCBI gene ID based annotations about Homo sapiens
# $taxonomyid: 9606
# $genome: NCBI genomes
# $sourcetype: NCBI/ensembl
# $sourceurl: ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/, ftp://ftp.ensembl.org/pub/current_fasta
# $sourcesize: NA
# $tags: c("NCBI", "Gene", "Annotation") 
# retrieve record with 'object[["AH95959"]]'

Hope this helps!

hbctraining / DGE_workshop_salmon_online

Creating annotation file tx2gene for NCBI human transcriptome #30