LooseLab / rftools

Tools for post processing readfish data
MIT License
1 stars 0 forks source link

Splitting duplex #6

Closed Adoni5 closed 4 months ago

Adoni5 commented 4 months ago

Duplex read ids seem to be a combination of their parents read_ids, joined by a semi colon. So we would compare both halves of the read_id in turn to the unblocked readids hashset.

If there is a match from either we would consider that read unblocked.

Adoni5 commented 4 months ago

Addressed in #9

mattloose commented 4 months ago

Just adding a comment - really we shouldn't have any of these. If we have unblocked the first read then the second read should not be related - i.e it should not be a duplex read.

If we had not unblocked the first read we should always accept the second read.

It would be interesting to look at any reads like this that are unblocked as they would be examples of things we didn't want happening....

Adoni5 commented 4 months ago

See https://github.com/LooseLab/readfish/pull/324 for discussion!