csiro-crop-informatics / biokanga_align_paper

0 stars 0 forks source link

Number of simulated reads not precisely as requested #2

Open rsuchecki opened 6 years ago

rsuchecki commented 6 years ago

Number of reads generated by some of the tools managed by rnftools is not precisely as requested, presumably due to a read being output based on some probability calculation, taking into consideration the size of input genome assembly.

We can either accept this number not being precise or sanitise rnftools output. I would opt for leaving it as is, as the same input is passed to each aligner. In addition, we may decide on reporting results as percentages anyway.

https://github.com/csiro-crop-informatics/biokanga_align_paper/blob/819fdcdcb434cce0e13bd2042c9d8e023558e3cc/main.nf#L147

rsuchecki commented 6 years ago

It might make sense to switch to using coverage in place of number_of_read_tuples.