mbhall88 / rasusa

Randomly subsample sequencing reads or alignments
https://doi.org/10.21105/joss.03941
MIT License
207 stars 17 forks source link

Does rasusa outputs reads that still cover the entire original genome? #56

Closed MostafaYA closed 2 years ago

MostafaYA commented 2 years ago

Hi, When using rasusa for downsampling, do we get reads that still cover the whole genome but with a reduced depth, or is it possible that the reads are only sampled from certain regions (i.e. there is no guarantee that the original genome is well represented in the output)? Thanks

mbhall88 commented 2 years ago

Hi @MostafaYA. Rasusa takes a random subsample of the reads, so there is no guarentee of what parts of the genome will be sampled - as Rasusa knows nothing about whether the reads align to on the genome.

If you want to subsample and ensure coverage across the genome I'd suggest taking a look at this suggestion.