ohnosequences / mg7

Configurable and scalable 16S metagenomics data analysis
https://goo.gl/y3rZFD
GNU Affero General Public License v3.0
3 stars 3 forks source link

add the average identity percentage to the counts #65

Closed eparejatobes closed 8 years ago

eparejatobes commented 8 years ago

From the BLAST output we have the pident field. We just need to

  1. add this value to the assignment output (this is useful in itself)
  2. calculate the average in the counting step
laughedelic commented 8 years ago

@eparejatobes one thing I'm doubting here:

laughedelic commented 8 years ago

Never mind. I forgot that we keep only hits with the same maximum pident 😓

eparejatobes commented 8 years ago

In general it should be average across all hits; we're keeping the max so no need to calculate it, but that could be a future setting: something like calculate the LCA for all hits close to the maximum pident, particularly when setting the pident threshold to a lower value.

rtobes commented 8 years ago

I agree. And when MG7 evolve for other uses (functional assignment for example) it would be useful. (the parameter is pident :))

laughedelic commented 8 years ago

This is done. With average pident: