nhoffman / ya16sdb

A curated subset of 16S rRNA sequences from NCBI
2 stars 3 forks source link

bin/match_hits.py look top hit within same species #47

Closed crosenth closed 2 years ago

crosenth commented 2 years ago

New algo:

  1. Expand vsearch to top 5-10-20 hits
  2. Select all hits at the single highest pct_id
  3. If a hit(s) exists with same species taxonomy id then choose that hit. Otherwise take whatever vsearch returns as top pct_id hit
crosenth commented 2 years ago

0.7.2 release - https://github.com/nhoffman/ya16sdb/commit/ac585957d098ee21b38b6e0716f8e03a7cad3a66

@dhoogest this may affect the NGS16S reports rendering:

https://gitlab.labmed.uw.edu/molmicro/NGS16S/-/blob/master/pipeline/render.py#L343