Closed JoseCorCab closed 4 years ago
@JoseCorCab: This is an issue tracker for bugs in miRDeep2. I don't see a bug reported so I'm going to close this issue.
That said, I suggest to convert the ARF you get from mapper.pl
to BAM using you method of choice, copying each SAM alignment as many times a indicated by the multiplicity in the collapsed read ID and extract your metrics from the resulting BAM file. There should be no need to map the identical reads hundreds or thousands of times to the same reference again and again just to collect some pupulations statistics.
Hello, I am trying to incorporate miRDeep2 to our workflow, parallel to a RNAseq analysis. I performed an execution using default mapper.pl and mirdeep.pl scripts and it works. However, our workflow generate a report file including information about RNAseq samples status and mapping metrics, and we use a BAM file to extact some of these metrics. I wish to extract the same information from mapper.pl output or its temporary files, but I have found that it uses non redundant fasta file (collapsed_reads.fa) as bowtie input and extracted metrics are biased by this, so I need to map all raw redundant reads onto genome generating a SAM or BWT, then convert the output file to BAM and extract all information, and finally generate a "collapsed" ARF for use mirdeep2.pl script. I have also seen that you can use as input a BWA SAM file but it is not clear in documentation if BWA mapping can be performed using all raw reads or collapsed reads:
After that you specified that reads_collapsed.fa and reads_vs_genome.arf must be generated to use mirdeep.pl
So I don't really know if bwa_sam_converter.pl script collapse needs a non-collapsed fasta mapped to genome and it generates a collapsed ARF file, or it needs to collapse fasta file before BWA mapping.
Could you explain me more specifically this performance?
Another tangential doubt is when you specified that:
Which extra fields do bwa_sam_converter.pl need from BWA SAM?
Thanks.