nebiolabs / domainator

A flexible and modular software suite for domain-based gene neighborhood and protein search, extraction, and clustering.
Other
13 stars 1 forks source link

sequence comparison in compare_contigs.py #5

Open seanrjohnson opened 5 months ago

seanrjohnson commented 5 months ago

Right now compare_contigs.py compares contigs based on domain content, it would be nice to also have some sequence-comparison based metrics. Such as, nucleotide or protein kmer jaccard or adjacency index, or local or global alignment methods, for example the Domain Sequence Similarity metric (https://www.biorxiv.org/content/10.1101/445270v1)