wilkelab / Opfi

A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.
https://opfi.readthedocs.io/
MIT License
21 stars 5 forks source link

Deduplicate supersets #182

Closed jimrybarski closed 3 years ago

jimrybarski commented 3 years ago

When we re-ran gene_finder with an expanded Cas12/13 database, we generated a bunch of entries that aren't removed by the deduplication functions. The new function deduplicates such operons.