reimandlab / ActiveDriverDB

ActiveDriverDB
GNU Lesser General Public License v2.1
12 stars 3 forks source link

Select the less significant call in case of duplicate HIV calls #179

Closed krassowski closed 2 years ago

krassowski commented 2 years ago

Similar to #178 but for HIV and using a different priortization. As seen on the attached screenshot:

Screenshot from 2022-02-06 21-06-59

some peptides result in identical PTM calls with different statistics provided (row 44 and 45). As we do not know which call is more likely to be correct, we can choose the more conservative one (the one with a higher p-value) to avoid too many false discoveries.

For some analyses a better approach would be to test both separately and compare the results, but it is not feasible to check all combinations in a sensitivity analysis, nor to import all combinations to the database without reworking the schema, so I suggest to take the path of being on the more cautious side and possibly systematically under-estimating the effects.