GoekeLab / xpore

Identification of differential RNA modifications from nanopore direct RNA sequencing
https://xpore.readthedocs.io/
MIT License
131 stars 23 forks source link

empty output files in dataprep stage #214

Open villegmb opened 3 months ago

villegmb commented 3 months ago

Hello. I am having problems with the output file from the dataprep stage. This is the code I am running:

xpore dataprep \
--eventalign /ibex/tmp/c2078/Heat_stress_analysis/xpore/data/25rep1/eventalign.txt \
--out_dir /ibex/tmp/c2078/Heat_stress_analysis/xpore/data/25rep1/dataprep \
--gtf_or_gff /ibex/tmp/c2078/Heat_stress_analysis/xpore/Ahem_transcriptome_with_reference.gtf \
--transcript_fasta /ibex/tmp/c2078/Heat_stress_analysis/scripts/transcripts_util_names_edited.fasta \
--genome \
--n_processes 24

This is my eventalign.txt file:

image

This is my gtf: image

My Fasta: image

And my dataprep directory looks like this: image

Empty: image

data.json: empty

image

image

image

This is from the running process:

image

image

Can you please tell me what am I doing wrong?

yuukiiwa commented 3 months ago

Hi @villegmb,

Can you try removing the sample column in your eventalign.txt file and run xpore dataprep again, please?

Thanks!

Best wishes, Yuk Kei

villegmb commented 3 months ago

Hi @villegmb,

Can you try removing the sample column in your eventalign.txt file and run xpore dataprep again, please?

Thanks!

Best wishes, Yuk Kei

Hello, I did it, but the outcome did not change. I got exactly the same.

yuukiiwa commented 3 months ago

Hi @villegmb,

I suspect that this has to do with how the gene_id (STRG.8) and transcript_id (STRG.446.1) look like in your fasta and gtf files. Can you try changing the gene_id and transcriptid to be separated with `instead of.in your fasta, gtf, and eventalign.txt files as a.is possibly recognized astranscript_versionbyxpore dataprep`.

Thanks!

Best wishes, Yuk Kei