Closed ConYel closed 3 years ago
When I removed sequences shorter than 18 from the collapsed fasta file and reperformed the map analysis to get the .ARF file mirDeep2.pl run without issues.
Is it possible to provide an error message for this particular reason, because this error message:
[1;31mError: [0mMapping file my_data/collapsed_trimmed_fasta/all_samples_collapsed.arf is not in arf format
it is a bit misleading.
Maybe somewhere with this error message?
Each line of the mapping file must consist of the following fields readID_wo_whitespaces length start end read_sequence genomicID_wo_whitspaces length start end genomic_sequence strand #mismatches editstring The editstring is optional and must not be contained The readID must end with _xNumber and is not allowed to contain whitespaces. The genomeID is not allowed to contain whitespaces.
Thank you very much, again great tool!
Hi, the two errors don't seem to be related.
To debug we would need to find the line which is not parsed properly. May you could run this in a console
perl -ane 'if($_ !~ /^(\S+_x\d+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\S+)\s+(\S+)\s+(\d+)\s+(\d+)\s+(\d+)\s+(\S+)\s+([+-])\s+(\d+)\s([mDIM])$/){print;}' YOUR_ARF_FILE
and post some of the lines here. (Special chars maybe typed in by you again depending on your console).
Cheers
Hello, unfortunately, I'm not able to reproduce it. it always gives:
[1;31mError: [0mproblem with my_data/collapsed_trimmed_fasta/all_samples_collapsed.fasta
Error in line 21: Either the sequence
GGTGTTGTAGGCTT
contains less than 17 characters or contains characters others than [acgtunACGTUN]
Please make sure that your file only comprises sequences that have at least 17 characters
containing letters [acgtunACGTUN]
I don't know exactly where was the issue. with the ARF file as I used the mapper.pl but anyway, I'll keep an eye if by any chance get that particular error again.
Thank you!!
Hello and thank you for this very useful tool! I came across this issue found in the log file after running miRDeep2.pl
I followed the instructions running first the mapper.pl in order to get an ARF file from my collapsed fasta.
mapper.pl my_data/collapsed_trimmed_fasta/all_samples_collapsed.fasta -c -p my_data/all_genome_data/my_species_BOWTIE_index/genome.v1 -o 2 -l 13 -t my_data/collapsed_trimmed_fasta/all_samples_collapsed_mapped.arf -v
A minimal example of the file I got from mapper.pl:
Regarding the whole installation, I'm using a docker container made from the Dockerfile attached.
A similar issue seems to have been opened before: https://github.com/rajewsky-lab/mirdeep2/issues/74#issue-754866169 but there wasn't follow up from that person so I thought to open a new one in case it could be used for future reference.
Thank you for your time and effort.
Dockerfile.txt
I tried to run again the pipeline but only with the "head" of the .arf file:
head my_data/collapsed_trimmed_fasta/all_samples_collapsed_mapped.arf > my_data/collapsed_trimmed_fasta/test.arf
and this time I got a different error:
I will remove sequences smaller than 17 and try again, but the first error message about the .ARF it seems a bit strange.