Closed sallesmath closed 2 years ago
This still seems like an issue with regular expressions (because your names are very long and somewhat non-standard). You should have 2 options:
RAPiD-Genomics_F176_FUP_141801_P002_WB08_i5-535_i7-57_S327:amcc204510
RAPiD-Genomics_F176_FUP_141801_P002_WC09_i5-535_i7-93_S340:mtr29622
This should work with the existing R1 and R2 patterns.
--r1-pattern "{}_(?:.*)_(?:.*)_(?:.*)_(?:.*)_(R1|READ1|Read1|read1)_\\d+.fastq(?:.gz)*"
--r2-pattern "{}_(?:.*)_(?:.*)_(?:.*)_(?:.*)_(R2|READ2|Read2|read2)_\\d+.fastq(?:.gz)*"
Thank you very much, Brant! I managed to solve.
Apparently I had two problems: 1) Illumiprocessor was not working inside phyluce for me. What I did was install illuminaprocessor directly through anaconda in a Python 2.7 virtual environment 2) The filenames were really long and probably illumiprocessor was having difficulties managing this kind of complexity. So I: i) renamed the raw_data files to simpler names; ii) ran illumiprocessor in subsets of sequences (~6 samples at a time), as suggested in previous posts; iii) used regular expressions similar to the ones you suggested.
Thanks again!
You are welcome, and I'm glad you got it working!
Hello!
This has been discussed previously and I've read those posts , but I'm still having the following error:
These are my files:
And this is my .config file:
I've already tried checking the configuration file several times, renamed the sequence files, changed the configuration of r1 and r2_pattern, but nothing worked.
One thing that caught my attention is that my error is of type OSError and not of type IOError as in previous cases. Could this be the possible reason for my error? If so, could you tell me why this happens?
Thanks!