Closed jpalmer37 closed 5 years ago
No something is wrong. I think using the global consensus sequence for hypermut is a mistake, we should be using the consensus of the sequences from the first sample collection date.
I see. That makes sense. I'll apply that fix. Thanks!
Applied this fix using this new script: hypermut screen. Still need to verify that output is reasonable.
The original data set contained 260 sequences: and hypermut.py filtered out 74 of these to leave 186 remaining:
The removal of these sequences clearly weakens the signal and date range of this data set. Do you think I should trust this result or investigate this behaviour more?