NVIDIA-Genomics-Research / GenomeWorks

SDK for GPU accelerated genome assembly and analysis
https://clara-parabricks.github.io/GenomeWorks/
Apache License 2.0
286 stars 76 forks source link

[pygenomeworks] Updates the evaluate_paf script. #557

Closed edawson closed 4 years ago

edawson commented 4 years ago

This PR should fix an issue where passing the same truth and test PAFs resulted in precision and recall values less than 1.0. It also refactors the way searches are done, using a defaultdict[sequence name] -> IntervalTree to find records on the query / target and adds several additional match metrics.