nanoporetech / pipeline-transcriptome-de

Pipeline for differential gene expression (DGE) and differential transcript usage (DTU) analysis using long reads
Other
106 stars 27 forks source link

Error in rule de_analysis: jobid: 9 #20

Closed MustafaElshani closed 3 years ago

MustafaElshani commented 4 years ago

No matter how hard I try I can't seem to get over this error

Loading counts, conditions and parameters.
Loading annotation database.
Import genomic features from the file as a GRanges object ... OK
Prepare the 'metadata' data frame ... OK
Make the TxDb object ... OK
Warning message:
In .get_cds_IDX(mcols0$type, mcols0$phase) :
  The "phase" metadata column contains non-NA values for features of type
  stop_codon. This information was ignored.
'select()' returned 1:many mapping between keys and columns
Error in rule de_analysis:
    jobid: 9
    output: de_analysis/results_dge.tsv, de_analysis/results_dge.pdf, de_analysis/results_dtu_gene.tsv, de_analysis/results_dtu_transcript.tsv, de_analysis/results_dtu_stageR.tsv, merged/all_counts_filtered.tsv, merged/all_gene_counts.tsv
    conda-env: /home/mustafa/pipeline-transcriptome-de/Workspaces/ExperimentPipelines/.snakemake/conda/fe062354
    shell:

    /home/mustafa/pipeline-transcriptome-de/scripts/de_analysis.R

        (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)

Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Complete log: /home/mustafa/pipeline-transcriptome-de/.snakemake/log/2020-08-26T192935.491547.snakemake.log
MustafaElshani commented 4 years ago

This issue seems to arise from using Gencode Reference files after switching to ENSMBL it worked

MustafaElshani commented 4 years ago

This issue seems to arise from using Gencode Reference files after switching to ENSMBL it worked

If using GENCODE reference add --gencode at the salmon quant line on Snakefile, It seperates the | into columns

callumparr commented 3 years ago

What version of salmon are you using? Did you update or downgrade from the one installed in the conda environment? I tried adding --gencode flag to the Snakefile and can it is output to the log when it runs but I still get this error when running the pipeline with gencode transcriptome file.

callumparr commented 3 years ago

Update: This does not work. And there are clear reasons why it should not work. --gencode flag only works when using the other mode of salmon, that is first making an index and doing the quasi-mapping. For quant mode --gencode does not work. you will need to manually run salmon on your samples mapped to gencode annotations or you can follow this issue perhaps where they use no standard annotation files and alter the snakefile to handed the different format of gtf.

https://github.com/nanoporetech/pipeline-transcriptome-de/issues/18#issuecomment-674001168