Open wangjiawen2013 opened 4 months ago
The splitting operation is pretty clear to me, but can you elaborate a bit more about what you’d expect from a subsampling command? For example, what would the parameters be, what would the expected input and output be?
You can make it by imitating samtools. Please refer to this post: https://www.biostars.org/p/76791/ https://bioinformatics.stackexchange.com/questions/402/how-can-i-downsample-a-bam-file-while-keeping-both-reads-in-pairs The input is a rad and the output is a smaller rad file subsampled randomly from the origin rad. It's better to set all the parameters the same as samtools for users from samtools to use it seamlessly.
I now have a prototype implementation of split
on the dev
branch. I will probably polish this a bit and cut another release, and then start working on sampling afterward.
Looking forward to it!
Hi @wangjiawen2013,
Great. I just cut the release of 0.2.0 with the split command. I’ll ping back here again when I have an implementation of sampling.
Best, Rob
it will be useful to support subsampling and spliting rad files with radtk.