Ibvt / RNAnue

RNAnue
GNU General Public License v3.0
8 stars 4 forks source link

Running RNAnue from container #8

Closed Egors01 closed 4 months ago

Egors01 commented 3 years ago

Hello,

Thank you for creating the pipeline.

I am encountering problems when trying to run the RNAnue from the docker container I have PE reads and use the config file created from the template provided here. The file is attached

When I run: _singularity exec rnanue_latest.sif /RNAnue/build/RNAnue subcall -c /data/meyer/egor/RNAnuepipeline/params.cfg

I got the following output whtout downstream errors/warnings or other output.

    after parameters
    RNAnue v0.1.0 - Detect RNA-RNA interactions from Direct-Duplex-Detection (DDD) data.
    create Base
    "Ja gut aehh..." - Randy OR  <some random citation>

Here is some details on my configuration:

In the config file: _trtms = /data/egor/RNAnuepipeline/paired/trtms


I would greately appreciate any help

[params.cfg.txt](https://github.com/Ibvt/RNAnue/files/6892403/params.cfg.txt)
riasc commented 3 years ago

Hi @Egors01, thanks again for the issue. The problem seems to be occurring on a specific set of identifiers in the .fastq files. I pushed a new image that should resolve that. Also the ending of the files is .fastq? (not .fq). Haven't captured this before.

I'm currently testing this on a singularity instance. Other than that your setup seems fine. Also, it seems the container was behind a few commits such that the parameters files couldn't be specified correctly. Sorry about that. Could you please try that with the recent image? I tried to make the description a bit more detailed and will try to describe the individual steps in more detail these days.. In principle, each subcall can be executed separately... preproc, align, detect, analysis..

You did execute it with the "complete" subcall ( as "complete") right? I think I haven't captured a non-specified subcall. Which I maybe should do.

Rich

params.cfg-2.txt

Egors01 commented 3 years ago

Hi, Rich

Thanks a lot. I am going to try with the new container build soon.

The reads are with the .fastq extension. I will try with .fq as well. I am not sure I understand the "complete" definition. From container I was running /RNAnue/build/RNAnue subcall -c which I wanted to make the run of all steps.

Best Egor

riasc commented 3 years ago

Hi Egor, ah yes I think I didn't put the subcall in "<>", thats why I guess it was a bit confusing. You would have to execute /RNAnue/build/RNAnue complete -c or just replace subcall with the individual steps.. preproc, align etc.. Haven't put that in the README.. Thanks.

Egors01 commented 3 years ago

Hi, Rich

I have tried to run the container build again, and it looks unusable in its current state. Below I provide the list of the problems that I have encountered.

  1. RNAnue complete execution fails with an error. Call:

singularity exec --bind /data/egor:/mnt ./rnanue_latest.sif /RNAnue/build/RNAnue complete -c /mnt/influenza_splash/2cimpl/RNAnue_pipeline/params_clean_container

Output:

RNAnue v0.1.0 - Detect RNA-RNA interactions from Direct-Duplex-Detection (DDD) data. complete analysis created instance of sequence RickShaw create lookup table using /mnt/influenza_splash/2cimpl/paired/r1r2-adapters.fa AGATCGGAAGAGCACACGTCTGAACTCCAGTCA AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output" already exists. created "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/complete" terminate called after throwing an instance of 'boost::wrapexcept' what(): No such node (complete) Aborted

  1. Container apparently does not contain/cannot find segemehl, so it is impossible to run any step other than "preproc". Error logs on RANue align:

$ singularity exec --bind /data/egor:/mnt ./rnanue_latest.sif /RNAnue/build/RNAnue align -c /mnt/influenza_splash/2cimpl/RNAnue_pipeline/params_clean_container RNAnue v0.1.0 - Detect RNA-RNA interactions from Direct-Duplex-Detection (DDD) data. reads the data to align *** collect the data ctrls: "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/preproc/ctrls" ...has been found in the filesystem! trtms: "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/preproc/trtms" ...has been found in the filesystem! wsn1_a retrieve output align called within Base create align object segemehl index found on filesystem "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output" already exists. "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align" already exists. "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align/ctrls" already exists. "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align/trtms" already exists. "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align/trtms/wsn1_a" already exists. start align sh: 1: segemehl.x: not found segemehl.x -S -A 90 -U 15 -W 80 -Z 15 -t 4 -m 20 -i /mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/GCA_000001405.15_GRCh38_no_alt_analysis_set.idx -d /mnt/influenza_splash/genome/hg38_chr/GCA_000001405.15_GRCh38_no_alt_analysis_set.fna -q /mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/preproc/trtms/wsn1_a/wsn1_a_preproc.fastq -o /mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align/trtms/wsn1_a/wsn1_a_preproc_matched.sam "Neeeeinnnnnn" - Biggi

  1. The similar error on RANnue detect. The container cannot find ViennaRNA.
    I have bypassed the upstream RANue align error with the manually compiled RNAnue build, so the following call takes the output of RNAnue aligns.

$ singularity exec --bind /data/egor:/mnt ./rnanue_latest.sif /RNAnue/build/RNAnue clustering -c /mnt/influenza_splash/2cimpl/RNAnue_pipeline/params_clean_container Output: RNAnue v0.1.0 - Detect RNA-RNA interactions from Direct-Duplex-Detection (DDD) data. *** collect the data ### WARNING - this call runs RNAnue without control data ### WARNING - "--ctrls" has not been set trtms: "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/align/trtms" ...has been found in the filesystem! wsn1_a "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output" already exists. created "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/detect" created "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/detect/trtms" created "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/detect/trtms/wsn1_a" sh: 1: RNAcofold: not found Segmentation fault

  1. When there is no control data, RNAnue align and detect steps also crash with the error if the folder "//ctrls" is not created manually. is the output folder that is specifid in the config file. is align, preproc etc. Call > $ singularity exec --bind /data/egor:/mnt ./rnanue_latest.sif /RNAnue/build/RNAnue align -c /mnt/influenza_splash/2cimpl/RNAnue_pipeline/params_clean_container Output > RNAnue v0.1.0 - Detect RNA-RNA interactions from Direct-Duplex-Detection (DDD) data. > reads the data to align > *** collect the data > ctrls: "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/preproc/ctrls" > \### ERROR - "/mnt/influenza_splash/2cimpl/RNAnue_pipeline/rnanue-output/preproc/ctrls"could not be found in the filesystem!

The config file that I use is attached params_clean_container.txt