Closed AShaw1802 closed 1 month ago
Current approach:
ambiguous mappings
, where a read maps to multiple references in the database, we currently filter these out as there is no clear signal for a specific reference. We propse to change the mapping parsing steps as such:
https://github.com/polio-nanopore/piranha/pull/222
Dev work on this issue continues- more permissive paf parsing will now raise issue of requiring masking of regions that do not have good coverage because of mapping failure.
This is now resolved on main.
I think we may be seeing a case in Pakistan where wt1 reads are mapping to sequences too similar in the reference database, so are being assigned to ambiguous mapping (I'm trying to get the raw data now to share). Is there a measure of how similar the reference sequences can be before it's an issue? We can screen the current database, but people may add their own sequences in the future- is there a way that Piranha could cope with similar references?