Closed dbolser closed 3 months ago
Thank you for the inquiry, but no, it's not something bcftools aspires to do. The right tool to use for this is tabix, provided by htslib.
Tabix is great, but it's not parallel.
Thanks for transferring it here. thought bcftools because you already have parallel indexing of BCF files and parsers for BED, but I guess bcf is nothing like bgzip'ed bed.
Is the issue here simply the multi-threaded bgzf decoding? I admit I assumed we'd got that enabled for everything, but indeed tabix doesn't do it apparently. I'm not sure why - either we simply missed it or perhaps it was testing and gave no major benefit (eg the primary CPU burden is elsewhere).
I could imagine it's IO bound... perhaps...
I'd like to index a bed file (tbi) using bcftools (because of the ability to use multiple threads).
Is this a feature that you could easily support?
Many thanks, Dan.