Closed dhoogest closed 1 year ago
@nhoffman recommends changing logic to iterate over specimens and vsearch with global pairwise aln on the shorter of the pair.
Reverse and forward reads are now grouped by specimen before clustering. Also note I also made the --iddef 0 update.
Also note the new counts.csv file. I also put barcodecop as the first step in the pipeline to sort out the index file(s) first to avoid compounding the if/else logic further down in the pipeline with the different index file variations.
@nhoffman @crosenth this approach would seem to address our intended use of the current
vsearch_svs
step, which I think is supposed to facilitate the combination of 'reverse' oriented SVs with the complementary forward 'passed' seqs. Change should address the problem described here, where an inaccurate sv clustering created the appearance of an SV in samples where no association was expected or seen prior to clustering.We should probably spell out the desired logic fully as part of implementing this change.