sstadick / perbase

Per-base per-nucleotide depth analysis
MIT License
115 stars 13 forks source link

New Tool for Per-Sequence Depths #62

Open oschwengers opened 1 year ago

oschwengers commented 1 year ago

Hi @sstadick and thanks a lot for developing perbase! I'd love to use perbase to estimate the final coverages of multisteps short/long/hybrid bacterial whole-genome assemblies. For these use cases, a distinct tool for per-sequence summarizing information would be very helpful.

For example if we could get the following information for each sequence within an assembly:

By "normalized cov" I mean the average coverage of a seq normalized to the average coverage of the longest sequence, like in Unicycler. By this we get important information, as for example that a sequence "B" has 5x the coverage of sequence "A" (often the longest sequence is the chromosome or at least a larger part of it).

Again thanks and best regards!

sstadick commented 1 year ago

Hi! I'll keep this open in the event that I come back into perbase to do more development, but I don't currently have enough bandwidth to be doing a lot of work here, PRs are always welcome though!