wilkelab / Opfi

A Python package for discovery, annotation, and analysis of gene clusters in genomics or metagenomics data sets.
https://opfi.readthedocs.io/
MIT License
21 stars 5 forks source link

Provide an option to incrementally write Gene Finder output to disk #136

Closed alexismhill3 closed 4 years ago

alexismhill3 commented 4 years ago

Also adds another closely related feature: the pipeline can now process data starting from an arbitrary contig in a file.

This makes a couple changes to the Gene Finder API that are not backwards compatible. Basically, I really didn't like how some arguments were basically redundant, but I couldn't think of a way to fix this without leaving a bunch of clunky, non-functional arguments. Since we currently only have 3 users I figure it's not a huge problem. Anyway, here are the changes:

I manually tested this on some real metagenomic data and it seems to be working.

alexismhill3 commented 4 years ago

Okay this should be ready to be reviewed again