Calculation of module abundance when coverages of the module are the same in different ways of calculation

raeslab / omixer-rpm

A Reference Pathways Mapper for turning metagenomic functional profiles into pathway/module profiles.

Other

24 stars 8 forks source link

Hi, thank you so much for develop and maintain this tool. And I have a question regarding on the calculation of module abundance. Say there is a module M, it includes two steps for a complete process. In the definition style of GBM database, it might be defined as:

/// M Some useful module K123 NOG456 K789 ///

So the abundance of M can be calculated from either one of:

Average(K123, K789)
Average(NOG456, K789) depending of whichever calculation has higher coverage

My question is:

What if the coverages of the two calculation are equal, but the average abundance of the module is not the same, then which abundance/calculation will OMIXER-RPM use.
What if K123 and NOG456 are the same gene, but they are just two different ID due to different naming system from different databse, which abundance/calculation will OMIXER-RPM use? Wouldn't that cause duplication of calculation?

Look forward to your reply! Thanks!

Best,

Ben

raeslab / omixer-rpm

Calculation of module abundance when coverages of the module are the same in different ways of calculation #8