wilkelab / Opfi

A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.
https://opfi.readthedocs.io/
MIT License
21 stars 5 forks source link

Make `assemble_operons()` memory efficient #132

Closed jimrybarski closed 4 years ago

jimrybarski commented 4 years ago

analyze.assemble_operons() currently loads every Operon object into memory before it begins to yield them. On our current dataset this is pushing 270 GB. A simple change would reduce this to a few kB.