NVIDIA-Genomics-Research / GenomeWorks

SDK for GPU accelerated genome assembly and analysis
https://clara-parabricks.github.io/GenomeWorks/
Apache License 2.0
286 stars 76 forks source link

[pygenomeworks] evalute_paf does not properly report the number of incorrect starts/ends #560

Closed edawson closed 4 years ago

edawson commented 4 years ago

The evaluate_paf script in pygenomeworks/bin reports a number of correct query/target starts and ends. However, the numbers reported currently are not accurate, as each searched interval increments these variables: https://github.com/clara-parabricks/GenomeWorks/blob/88dcc74b17a659e1baf21139920a41d9e0cac7f6/pygenomeworks/bin/evaluate_paf#L195-L198

The proper behavior should instead be to only report the correctness of starts/ends only for the best match.