Open j23414 opened 7 months ago
Flagged by https://github.com/nextstrain/dengue/issues/28#issuecomment-1951297740 as well as prior historical discussions.
Design and implement some deduplication paths in the phylogentic workflow.
Preferably, leverage the existing tools in the nextstrain dockerfile, with seqkit being a probable choice.
Flagging some duplicates form a slack message here
Context
Flagged by https://github.com/nextstrain/dengue/issues/28#issuecomment-1951297740 as well as prior historical discussions.
Design and implement some deduplication paths in the phylogentic workflow.
Description
Examples
Possible solution
Preferably, leverage the existing tools in the nextstrain dockerfile, with seqkit being a probable choice.