Closed parvathisudha closed 3 years ago
I have a similar problem. Attached are:
Any idea what went wrong?
Is there any output to the terminal when salmon is running that would suggest it couldn't interpret the GTF properly? Can you share the salmon log file?
Any idea what went wrong?
I had that issue while using Salmon version 0.9.1. Once I upgraded the salmon version, I could index the genome properly and I obtained gene-level TPM for the samples.
This is the initial output log, where it reports an inccorrect gene annotation:
Error: invalid feature coordinates (end<start!) at line: NC_029855.1 RefSeq gene 406748 107842 . + . gene_id "A5N79_gp28"; db_xref "GeneID:27215502"; exception "trans-splicing"; gbkey "Gene"; gene "nad2"; gene_biotype "protein_coding"; locus_tag "A5N79_gp28";
After I remove the erroneous entry, there is no more complaint:
However, the *.sf files are the same as previous ones, i.e. no gene level results.
Interestingly, the same gtf file can be used to obtain gene level counts using HTSeq-count in OmicsBox, suggesting that the file is working fine.
I have used tximport to complete the job instead.
Thanks for letting us know. Tximport is the recommended way to accomplish this.
Hi Salmon team,
Thanks for the tool! I am trying to create the salmon index for GRCh38 using Gencode. When I did the quantification, even though I added GTF file, "quant.genes.sf is only showing the Transcript id's not gene id's. Can you please tell me how to solve this issue? Is there a way to create a GTF based salmon index file for GRCh38?
Thanks Parvtahi.