Hi,
Thanks so much for making such amazing software.
I have some problems when I was using my de novo assembly transcriptome with ENSEMBL reference of the species other than human.
In the file Ularcirc/inst/shiny-app/circRNA/Server.R on the line 269 and 1338, the regular expression, ^ENSG([-0-9]+) and ^ENSG([-0-9]+), is only validate for Human case.
The Ensembl ID format is like ENS\<species prefix>\<feature type prefix>\<a unique eleven digit number>. For example, ENSMUSG00000017167 is a ENSEMBL stable ID of a mouse gene.
Therefore, I changed the original Ensembl ID test to the follows:
for the line 269:
test0 <- gsubfn::strapplyc(as.character(GeneName),pattern="^(ENS[[:alpha:]]*).*")
test <- gsubfn::strapplyc(as.character(GeneName),pattern=paste0("^",test0[[1]],"([-0-9]+)"))
if (length(test[[1]]) > 0) # Ensembl ID
Also, when I was tring to use my custome BSgenome and TxDb packages based on the de novo assembly transcriptome with ENSEMBL reference, I got the similar errors. Because the original function “Gene_Transcript_Features” is based on ENTREZID but my circexplorer2 is based on ENSEMBL ID.
Therefore, I added a ENSEMBL ID test in the “Gene_Transcript_Features” function as below:
for the line 450:
test0 <- gsubfn::strapplyc(as.character(Gene_Symbol),pattern="^(ENS[[:alpha:]]*).*")
test <- gsubfn::strapplyc(as.character(Gene_Symbol),pattern=paste0("^",test0[[1]],"([-0-9]+)"))
if (length(test[[1]]) > 0) # Ensembl ID
{
ensembl_gene <- paste(test0[[1]],test[[1]],sep="")
a <- select(GeneList$Annotation_Library, keys = Gene_Symbol, columns=c("ENTREZID", "SYMBOL", "ENSEMBL"),keytype="ENSEMBL")
if("EXONRANK"%in%keytypes(GeneList$transcript_reference)){
b <- select(GeneList$transcript_reference, keys = a$ENSEMBL, columns=c('GENEID', 'TXNAME'),keytype="GENEID")
}else{
b <- select(GeneList$transcript_reference, keys = a$ENSEMBL, columns=c('GENEID', 'TXNAME', 'EXONRANK'),keytype="GENEID")
}
}
else
{
a <- select(GeneList$Annotation_Library, keys = Gene_Symbol, columns=c("ENTREZID", "SYMBOL"),keytype="SYMBOL")
if("EXONRANK"%in%keytypes(GeneList$transcript_reference)){
b <- select(GeneList$transcript_reference, keys = a$ENTREZID, columns=c('GENEID', 'TXNAME'),keytype="GENEID")
}else{
b <- select(GeneList$transcript_reference, keys = a$ENTREZID, columns=c('GENEID', 'TXNAME', 'EXONRANK'),keytype="GENEID")
}
}
I forked this project and made the above changes in that branch. Could you please check my modifications?
Hi, Thanks so much for making such amazing software.
I have some problems when I was using my de novo assembly transcriptome with ENSEMBL reference of the species other than human.
In the file Ularcirc/inst/shiny-app/circRNA/Server.R on the line 269 and 1338, the regular expression, ^ENSG([-0-9]+) and ^ENSG([-0-9]+), is only validate for Human case.
The Ensembl ID format is like ENS\<species prefix>\<feature type prefix>\<a unique eleven digit number>. For example, ENSMUSG00000017167 is a ENSEMBL stable ID of a mouse gene.
Therefore, I changed the original Ensembl ID test to the follows:
for the line 269:
for the line 1338:
Also, when I was tring to use my custome BSgenome and TxDb packages based on the de novo assembly transcriptome with ENSEMBL reference, I got the similar errors. Because the original function “Gene_Transcript_Features” is based on ENTREZID but my circexplorer2 is based on ENSEMBL ID.
Therefore, I added a ENSEMBL ID test in the “Gene_Transcript_Features” function as below:
for the line 450:
I forked this project and made the above changes in that branch. Could you please check my modifications?
https://github.com/wong-ziyi/Ularcirc
Thanks.