broadinstitute / viral-assemble

viral-ngs: genome assembly and scaffolding
Other
8 stars 2 forks source link

make reference imputation tolerate incomplete draft assembly input #41

Open dpark01 opened 8 months ago

dpark01 commented 8 months ago

Currently, our assembly.impute_from_reference command requires inFasta and inReference to have the same number of sequences. This is because, in order to perform imputation, we pairwise align the whole segment of inFasta segment i to inReference segment i because we don't want to guess which segment is which if we don't receive a complete genome.

But maybe in the case that inFasta has less than the full number of segments compared to inReference, the user is a bit desperate anyway.. maybe we can use skani to pick which inReference segment corresponds to which inFasta segment and proceed as before.