rotary-genomics / spokewrench

Toolkit for manipulating circular DNA sequence elements
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Replace circlator merge with internal code #10

Open jmtsuji opened 10 months ago

jmtsuji commented 10 months ago

As mentioned in rotary-genomics/rotary-utils#8, the merge module from circlator (a very helpful package!) is currently used to guide the final step in the end repair workflow. circlator merge uses nucmer (from the mummer package) to align a 'guide contig' (spanning the ends of the FastA entry for the circular contig) with the circular reference contig. Then, circlator merge parses the nucmer hit table to find suitable matches of the two contigs. If the match stats pass required thresholds (e.g., minimum % ID and position relative to the ends of both contigs), then the circlator merge module stitches the guide contig into the circular contig at the sites determined by nucmer. This process ultimately allows circlator merge to repair mis-assembled regions around the ends (in the FastA file) of the circular contig.

Unfortunately, circlator merge is no longer actively supported. It is also part of a larger package that includes many dependencies we don't need and uses approaches that aren't up-to-date with some of the rapid changes that have occured in the field of long read sequencing. Because of this, I'd like to drop circlator merge from the end repair code and replace it with an in-house alternative.

Desired features of the in-house alternative:

jmtsuji commented 10 months ago

@LeeBergstrand My apologies, I accidentally made my initial commit for this project to the develop branch at 81bb669! I've now made a separate branch, stitch_module, for further development. The stitch.py code is in a separate file from other code, so this accidental commit should have limited impact on the rest of the repo.

LeeBergstrand commented 4 months ago

@jmtsuji Feel free to close if this has been addressed.

jmtsuji commented 4 months ago

Still ongoing -- thanks for migrating this over to the new repo.