Closed shanmugavadivelps closed 1 year ago
Hi, can you post here the commands and error message?
Anyway, if you use Docker, you can use the Docker image available from dockerhub:
docker pull mazzalab/fastqwiper
It will include both fastqwiper and the Snakemake workflows for complete wiping. Once downloaded the image, you can type:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper paired 8 sample
where:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper single 8 excerpt_R1_001
Thanks for your reply.
Sir/Madam,
I am using as below
fastqwiper --fastq_in SRR13556219_2.fastq.gz --fastq_out 22.fastq.gz
Start wiping SRR13556219_2.fastq.gz
2023-08-24 09:53:53,391 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
Traceback (most recent call last):
File "/home/microbiology/anaconda3/envs/FastqWiper/bin/fastqwiper", line
11, in
I am not using docker and I am not a pure bioinformatician. Here I have used a single file to solve the read issue. Pls help me to solve the issue
On Thu, Aug 24, 2023 at 12:14 AM MazzaLab @.***> wrote:
Hi, can you post here the commands and error message?
Anyway, if you use Docker, you can use the Docker image available from dockerhub:
docker pull mazzalab/fastqwiper
It will include both fastqwiper and the Snakemake workflows for complete wiping. Once downloaded the image, you can type:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper paired 8 sample
where:
- YOUR_STATIC_PATH_TO_DATA_FOLDER should be the path to the folder where you have the fastq.gz files to be wiped
- "paired" trigger the cleaning of R1 and R2 while "single" that of individual files
- "8" is the number of cores to be spawned;
- "sample" is part of the names of the files to be wiped. In this regard, remember that: 4.1 - for paired-end files (e.g., "sample_R1.fastq.gz" and "sample_R2.fastq.gz"), your files must finish with "_R1.fastq.gz" and "_R2.fastq.gz". The text to pass is everything before this text, "sample" in this case. 4.2 - for single end files (e.g., "excerpt_R1_001.fastq.gz"), your file must ends with the string ".fastq.gz"; all the preceding text, i.e., "excerpt_R1_001" will be the text to be passed to the command above:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper single 8 excerpt_R1_001
— Reply to this email directly, view it on GitHub https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1690459603, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN2V7HUUU2V5RPLBXX4ZXATXWZFQJANCNFSM6AAAAAA33VR53M . You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
OK, FastWiper alone (i.e., without Snakemake pipelines) is not able to read corrupted files. Yours seems to be corrupted.
If you cannot install Snakemake, please send me your file, I'll clean it for you.
best wishes,
tom
-
Tommaso Mazza, Eng. Ph.D.
Laboratory of Bioinformatics, PI
Fondazione IRCCS Casa Sollievo della Sofferenza
Viale Regina Margherita 261 - 00198 Roma IT
Tel: +39 06 44160526 - Fax: +39 06 44160548
E-mail: @.***
Web page: http://www.css-mendel.it/
Web page: http://bioinformatics.css-mendel.it
From: shanmugavadivelps @.***> Sent: Thursday, August 24, 2023 7:26 AM To: mazzalab/fastqwiper Cc: Tommaso Mazza; Comment Subject: Re: [mazzalab/fastqwiper] unable to install mamba using conda as you suggested (Issue #5)
Thanks for your reply.
Sir/Madam,
I am using as below
fastqwiper --fastq_in SRR13556219_2.fastq.gz --fastq_out 22.fastq.gz
Start wiping SRR13556219_2.fastq.gz
2023-08-24 09:53:53,391 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
500000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned
1000000 reads
Traceback (most recent call last):
File "/home/microbiology/anaconda3/envs/FastqWiper/bin/fastqwiper", line
11, in
I am not using docker and I am not a pure bioinformatician. Here I have used a single file to solve the read issue. Pls help me to solve the issue
On Thu, Aug 24, 2023 at 12:14 AM MazzaLab @.***> wrote:
Hi, can you post here the commands and error message?
Anyway, if you use Docker, you can use the Docker image available from dockerhub:
docker pull mazzalab/fastqwiper
It will include both fastqwiper and the Snakemake workflows for complete wiping. Once downloaded the image, you can type:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper paired 8 sample
where:
- YOUR_STATIC_PATH_TO_DATA_FOLDER should be the path to the folder where you have the fastq.gz files to be wiped
- "paired" trigger the cleaning of R1 and R2 while "single" that of individual files
- "8" is the number of cores to be spawned;
- "sample" is part of the names of the files to be wiped. In this regard, remember that: 4.1 - for paired-end files (e.g., "sample_R1.fastq.gz" and "sample_R2.fastq.gz"), your files must finish with "_R1.fastq.gz" and "_R2.fastq.gz". The text to pass is everything before this text, "sample" in this case. 4.2 - for single end files (e.g., "excerpt_R1_001.fastq.gz"), your file must ends with the string ".fastq.gz"; all the preceding text, i.e., "excerpt_R1_001" will be the text to be passed to the command above:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper single 8 excerpt_R1_001
— Reply to this email directly, view it on GitHub https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1690459603, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN2V7HUUU2V5RPLBXX4ZXATXWZFQJANCNFSM6AAAAAA33VR53M . You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
— Reply to this email directly, view it on GitHubhttps://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1691025013, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ACMMOGOTQJUE6FQNLFFKAFDXW3QZZANCNFSM6AAAAAA33VR53M. You are receiving this because you commented.Message ID: @.***>
i have installed now and executed as below py38) @.***:~ $ snakemake -s /home/microbiology/fastqwiper/pipeline/fix_wipe_pairs_reads.smk --use-conda --cores 2 Building DAG of jobs... Your conda installation is not configured to use strict channel priorities. This is however crucial for having robust and correct environments (for details, see https://conda-forge.org/docs/user/tipsandtricks.html). Please consider to configure strict priorities by executing 'conda config --set channel_priority strict'. Using shell: /usr/bin/bash Provided cores: 2 Rules claiming more threads will be scaled down. Job stats: job count min threads max threads
all 1 1 1 drop_unpaired 1 2 2 fix_interleaving 1 1 1 wipe_fastq 2 1 1 total 5 1 2
Select jobs to execute...
[Thu Aug 24 13:36:26 2023] rule wipe_fastq: input: data/sample_S1_R1_001_fixed.fastq output: data/sample_S1_R1_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log jobid: 3 reason: Missing output files: data/sample_S1_R1_001_fixed_wiped.fastq.gz wildcards: sample=sample_S1_R1_001 resources: tmpdir=/tmp
[Thu Aug 24 13:36:26 2023] rule wipe_fastq: input: data/sample_S1_R2_001_fixed.fastq output: data/sample_S1_R2_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log jobid: 5 reason: Missing output files: data/sample_S1_R2_001_fixed_wiped.fastq.gz wildcards: sample=sample_S1_R2_001 resources: tmpdir=/tmp
[Thu Aug 24 13:36:27 2023] Error in rule wipe_fastq: jobid: 5 input: data/sample_S1_R2_001_fixed.fastq output: data/sample_S1_R2_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log (check log file(s) for error details) shell: python fastq_wiper/wiper.py --fastq_in data/sample_S1_R2_001_fixed.fastq --fastq_out data/sample_S1_R2_001_fixed_wiped.fastq.gz 2> logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
[Thu Aug 24 13:36:27 2023] Error in rule wipe_fastq: jobid: 3 input: data/sample_S1_R1_001_fixed.fastq output: data/sample_S1_R1_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log (check log file(s) for error details) shell: python fastq_wiper/wiper.py --fastq_in data/sample_S1_R1_001_fixed.fastq --fastq_out data/sample_S1_R1_001_fixed_wiped.fastq.gz 2> logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: .snakemake/log/2023-08-24T133623.085748.snakemake.log
On Thu, Aug 24, 2023 at 1:03 PM MazzaLab @.***> wrote:
OK, FastWiper alone (i.e., without Snakemake pipelines) is not able to read corrupted files. Yours seems to be corrupted.
If you cannot install Snakemake, please send me your file, I'll clean it for you.
best wishes,
tom
-
Tommaso Mazza, Eng. Ph.D.
Laboratory of Bioinformatics, PI
Fondazione IRCCS Casa Sollievo della Sofferenza
Viale Regina Margherita 261 - 00198 Roma IT
Tel: +39 06 44160526 - Fax: +39 06 44160548
E-mail: @.***
Web page: http://www.css-mendel.it/
Web page: http://bioinformatics.css-mendel.it
From: shanmugavadivelps @.***> Sent: Thursday, August 24, 2023 7:26 AM To: mazzalab/fastqwiper Cc: Tommaso Mazza; Comment Subject: Re: [mazzalab/fastqwiper] unable to install mamba using conda as you suggested (Issue #5)
Thanks for your reply.
Sir/Madam,
I am using as below
fastqwiper --fastq_in SRR13556219_2.fastq.gz --fastq_out 22.fastq.gz Start wiping SRR13556219_2.fastq.gz 2023-08-24 09:53:53,391 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads Traceback (most recent call last): File "/home/microbiology/anaconda3/envs/FastqWiper/bin/fastqwiper", line 11, in
sys.exit(wipe_fastq()) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py", line 829, in call return self.main(*args, **kwargs) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 782, in main rv = self.invoke(ctx) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 1066, in invoke return ctx.invoke(self.callback, **ctx.params) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 610, in invoke return callback(*args, **kwargs) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/fastq_wiper/wiper.py",
line 88, in wipe_fastq for line in fin: File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 305, in read1 return self._buffer.read1(size) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/_compression.py",
line 68, in readinto data = self.read(len(byte_view)) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 470, in read self._read_eof() File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 516, in _read_eof raise BadGzipFile("CRC check failed %s != %s" % (hex(crc32), gzip.BadGzipFile: CRC check failed 0x7f073a84 != 0x6287761b
I am not using docker and I am not a pure bioinformatician. Here I have used a single file to solve the read issue. Pls help me to solve the issue
On Thu, Aug 24, 2023 at 12:14 AM MazzaLab @.***> wrote:
Hi, can you post here the commands and error message?
Anyway, if you use Docker, you can use the Docker image available from dockerhub:
docker pull mazzalab/fastqwiper
It will include both fastqwiper and the Snakemake workflows for complete wiping. Once downloaded the image, you can type:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper paired 8 sample
where:
- YOUR_STATIC_PATH_TO_DATA_FOLDER should be the path to the folder where you have the fastq.gz files to be wiped
- "paired" trigger the cleaning of R1 and R2 while "single" that of individual files
- "8" is the number of cores to be spawned;
- "sample" is part of the names of the files to be wiped. In this regard, remember that: 4.1 - for paired-end files (e.g., "sample_R1.fastq.gz" and "sample_R2.fastq.gz"), your files must finish with "_R1.fastq.gz" and "_R2.fastq.gz". The text to pass is everything before this text, "sample" in this case. 4.2 - for single end files (e.g., "excerpt_R1_001.fastq.gz"), your file must ends with the string ".fastq.gz"; all the preceding text, i.e., "excerpt_R1_001" will be the text to be passed to the command above:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper single 8 excerpt_R1_001
— Reply to this email directly, view it on GitHub https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1690459603,
or unsubscribe < https://github.com/notifications/unsubscribe-auth/AN2V7HUUU2V5RPLBXX4ZXATXWZFQJANCNFSM6AAAAAA33VR53M>
. You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
— Reply to this email directly, view it on GitHub< https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1691025013>, or unsubscribe< https://github.com/notifications/unsubscribe-auth/ACMMOGOTQJUE6FQNLFFKAFDXW3QZZANCNFSM6AAAAAA33VR53M>.
You are receiving this because you commented.Message ID: @.***>
— Reply to this email directly, view it on GitHub https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1691160150, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN2V7HV7LWYYGFCL4GWSH23XW37UPANCNFSM6AAAAAA33VR53M . You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
i have modified .smk file and attached for your reference
On Thu, Aug 24, 2023 at 1:38 PM @. < @.> wrote:
i have installed now and executed as below py38) @.***:~ $ snakemake -s /home/microbiology/fastqwiper/pipeline/fix_wipe_pairs_reads.smk --use-conda --cores 2 Building DAG of jobs... Your conda installation is not configured to use strict channel priorities. This is however crucial for having robust and correct environments (for details, see https://conda-forge.org/docs/user/tipsandtricks.html). Please consider to configure strict priorities by executing 'conda config --set channel_priority strict'. Using shell: /usr/bin/bash Provided cores: 2 Rules claiming more threads will be scaled down. Job stats: job count min threads max threads
all 1 1 1 drop_unpaired 1 2 2 fix_interleaving 1 1 1 wipe_fastq 2 1 1 total 5 1 2
Select jobs to execute...
[Thu Aug 24 13:36:26 2023] rule wipe_fastq: input: data/sample_S1_R1_001_fixed.fastq output: data/sample_S1_R1_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log jobid: 3 reason: Missing output files: data/sample_S1_R1_001_fixed_wiped.fastq.gz wildcards: sample=sample_S1_R1_001 resources: tmpdir=/tmp
[Thu Aug 24 13:36:26 2023] rule wipe_fastq: input: data/sample_S1_R2_001_fixed.fastq output: data/sample_S1_R2_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log jobid: 5 reason: Missing output files: data/sample_S1_R2_001_fixed_wiped.fastq.gz wildcards: sample=sample_S1_R2_001 resources: tmpdir=/tmp
[Thu Aug 24 13:36:27 2023] Error in rule wipe_fastq: jobid: 5 input: data/sample_S1_R2_001_fixed.fastq output: data/sample_S1_R2_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log (check log file(s) for error details) shell: python fastq_wiper/wiper.py --fastq_in data/sample_S1_R2_001_fixed.fastq --fastq_out data/sample_S1_R2_001_fixed_wiped.fastq.gz 2> logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
[Thu Aug 24 13:36:27 2023] Error in rule wipe_fastq: jobid: 3 input: data/sample_S1_R1_001_fixed.fastq output: data/sample_S1_R1_001_fixed_wiped.fastq.gz log: logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log (check log file(s) for error details) shell: python fastq_wiper/wiper.py --fastq_in data/sample_S1_R1_001_fixed.fastq --fastq_out data/sample_S1_R1_001_fixed_wiped.fastq.gz 2> logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: .snakemake/log/2023-08-24T133623.085748.snakemake.log
On Thu, Aug 24, 2023 at 1:03 PM MazzaLab @.***> wrote:
OK, FastWiper alone (i.e., without Snakemake pipelines) is not able to read corrupted files. Yours seems to be corrupted.
If you cannot install Snakemake, please send me your file, I'll clean it for you.
best wishes,
tom
-
Tommaso Mazza, Eng. Ph.D.
Laboratory of Bioinformatics, PI
Fondazione IRCCS Casa Sollievo della Sofferenza
Viale Regina Margherita 261 - 00198 Roma IT
Tel: +39 06 44160526 - Fax: +39 06 44160548
E-mail: @.***
Web page: http://www.css-mendel.it/
Web page: http://bioinformatics.css-mendel.it
From: shanmugavadivelps @.***> Sent: Thursday, August 24, 2023 7:26 AM To: mazzalab/fastqwiper Cc: Tommaso Mazza; Comment Subject: Re: [mazzalab/fastqwiper] unable to install mamba using conda as you suggested (Issue #5)
Thanks for your reply.
Sir/Madam,
I am using as below
fastqwiper --fastq_in SRR13556219_2.fastq.gz --fastq_out 22.fastq.gz Start wiping SRR13556219_2.fastq.gz 2023-08-24 09:53:53,391 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:53:53,392 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 500000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads 2023-08-24 09:55:04,876 - fastq_wiper/wiper.py(wipe_fastq) - INFO - Cleaned 1000000 reads Traceback (most recent call last): File "/home/microbiology/anaconda3/envs/FastqWiper/bin/fastqwiper", line 11, in
sys.exit(wipe_fastq()) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py", line 829, in call return self.main(*args, **kwargs) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 782, in main rv = self.invoke(ctx) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 1066, in invoke return ctx.invoke(self.callback, **ctx.params) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/click/core.py",
line 610, in invoke return callback(*args, **kwargs) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/site-packages/fastq_wiper/wiper.py",
line 88, in wipe_fastq for line in fin: File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 305, in read1 return self._buffer.read1(size) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/_compression.py",
line 68, in readinto data = self.read(len(byte_view)) File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 470, in read self._read_eof() File "/home/microbiology/anaconda3/envs/FastqWiper/lib/python3.8/gzip.py", line 516, in _read_eof raise BadGzipFile("CRC check failed %s != %s" % (hex(crc32), gzip.BadGzipFile: CRC check failed 0x7f073a84 != 0x6287761b
I am not using docker and I am not a pure bioinformatician. Here I have used a single file to solve the read issue. Pls help me to solve the issue
On Thu, Aug 24, 2023 at 12:14 AM MazzaLab @.***> wrote:
Hi, can you post here the commands and error message?
Anyway, if you use Docker, you can use the Docker image available from dockerhub:
docker pull mazzalab/fastqwiper
It will include both fastqwiper and the Snakemake workflows for complete wiping. Once downloaded the image, you can type:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper paired 8 sample
where:
- YOUR_STATIC_PATH_TO_DATA_FOLDER should be the path to the folder where you have the fastq.gz files to be wiped
- "paired" trigger the cleaning of R1 and R2 while "single" that of individual files
- "8" is the number of cores to be spawned;
- "sample" is part of the names of the files to be wiped. In this regard, remember that: 4.1 - for paired-end files (e.g., "sample_R1.fastq.gz" and "sample_R2.fastq.gz"), your files must finish with "_R1.fastq.gz" and "_R2.fastq.gz". The text to pass is everything before this text, "sample" in this case. 4.2 - for single end files (e.g., "excerpt_R1_001.fastq.gz"), your file must ends with the string ".fastq.gz"; all the preceding text, i.e., "excerpt_R1_001" will be the text to be passed to the command above:
docker run --rm -ti --name fastqwiper -v "YOUR_STATIC_PATH_TO_DATA_FOLDER:/fastqwiper/data" mazzalab/fastqwiper single 8 excerpt_R1_001
— Reply to this email directly, view it on GitHub < https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1690459603>,
or unsubscribe < https://github.com/notifications/unsubscribe-auth/AN2V7HUUU2V5RPLBXX4ZXATXWZFQJANCNFSM6AAAAAA33VR53M>
. You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
— Reply to this email directly, view it on GitHub< https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1691025013>, or unsubscribe< https://github.com/notifications/unsubscribe-auth/ACMMOGOTQJUE6FQNLFFKAFDXW3QZZANCNFSM6AAAAAA33VR53M>.
You are receiving this because you commented.Message ID: @.***>
— Reply to this email directly, view it on GitHub https://github.com/mazzalab/fastqwiper/issues/5#issuecomment-1691160150, or unsubscribe https://github.com/notifications/unsubscribe-auth/AN2V7HV7LWYYGFCL4GWSH23XW37UPANCNFSM6AAAAAA33VR53M . You are receiving this because you authored the thread.Message ID: @.***>
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
-- Shanmugavadivel, P. S. Scientist (Agricultural Biotechnology),
#216, Block A, ICAR-Indian Institute of Pulses Research,
Min. of Agriculture & Farmers Welfare,
Govt. of India,Kanpur - 208 024. *email: @. @.> https://iipr.icar.gov.in/ https://iipr.icar.gov.in/*
I cannot see the attached file Snakemake file. Please attach it through the GitHub web interface.
Can you also send me these files? logs/wipe_fastq/wipe_fastq.sample_S1_R2_001.log logs/wipe_fastq/wipe_fastq.sample_S1_R1_001.log
[Uploading fix_wipe_pairs_reads.smk…]()
wipe_fastq.sample_S1_R1_001.log wipe_fastq.sample_S1_R2_001.log
OK it seems it does not find the fastqwiper executable. Can you edit the "shell command " of tje Snakemake workflow here:
into:
shell:
"fastqwiper --fastq_in {input} --fastq_out {output} 2> {log}"
you need to execute the workflow within the cloned repository; where you have executed:
git clone https://github.com/mazzalab/fastqwiper.git
i have done as you said.
still i am encountering a failure
Error in rule fix_interleaving: jobid: 1 input: data/sample_S1_R1_001_fixed_wiped_paired.fastq.gz, data/sample_S1_R2_001_fixed_wiped_paired.fastq.gz output: data/sample_S1_R1_001_fixed_wiped_paired_interleaving.fastq.gz, data/sample_S1_R2_001_fixed_wiped_paired_interleaving.fastq.gz log: logs/pairing/pairing.sample.log (check log file(s) for error details) shell: bbmap/repair.sh in=data/sample_S1_R1_001_fixed_wiped_paired.fastq.gz in2=data/sample_S1_R2_001_fixed_wiped_paired.fastq.gz out=data/sample_S1_R1_001_fixed_wiped_paired_interleaving.fastq.gz out2=data/sample_S1_R2_001_fixed_wiped_paired_interleaving.fastq.gz outsingle=singletons.fastq.gz 2> logs/pairing/pairing.sample.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
[Thu Aug 24 15:03:33 2023] Error in rule fix_interleaving: jobid: 7 input: data/excerpt_S1_R1_001_fixed_wiped_paired.fastq.gz, data/excerpt_S1_R2_001_fixed_wiped_paired.fastq.gz output: data/excerpt_S1_R1_001_fixed_wiped_paired_interleaving.fastq.gz, data/excerpt_S1_R2_001_fixed_wiped_paired_interleaving.fastq.gz log: logs/pairing/pairing.excerpt.log (check log file(s) for error details) shell: bbmap/repair.sh in=data/excerpt_S1_R1_001_fixed_wiped_paired.fastq.gz in2=data/excerpt_S1_R2_001_fixed_wiped_paired.fastq.gz out=data/excerpt_S1_R1_001_fixed_wiped_paired_interleaving.fastq.gz out2=data/excerpt_S1_R2_001_fixed_wiped_paired_interleaving.fastq.gz outsingle=singletons.fastq.gz 2> logs/pairing/pairing.excerpt.log (one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Shutting down, this might take some time. Exiting because a job execution failed. Look above for error message Complete log: .snakemake/log/2023-08-24T150326.687136.snakemake.log
also attaching log files wipe_fastq.excerpt_S1_R1_001.log wipe_fastq.excerpt_S1_R2_001.log wipe_fastq.sample_S1_R1_001.log wipe_fastq.sample_S1_R2_001.log pairing.excerpt.log pairing.sample.log fix_gzrt.excerpt_S1_R1_001.log fix_gzrt.excerpt_S1_R2_001.log fix_gzrt.sample_S1_R1_001.log fix_gzrt.sample_S1_R2_001.log
Now its working fine. I have downloaded BBmap again and its working fine.
great!
Anyway, I'll make and share a video-tutorial to explain installation and usage soon.
Thanks for your support sir. Really happy to useSent from my iPhoneOn 24-Aug-2023, at 4:39 PM, MazzaLab @.***> wrote: great! Anyway, I'll make and share a video-tutorial to explain installation and usage soon.
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you modified the open/close state.Message ID: @.***>
I am using ubuntu 22.04 LTS and using conda, I am trying to install MAMBA and snakemake.
But I am unable to install MAMBA.
is it work with python 2.7 environment?.
pls help me to solve the issue.