vdemichev / DiaNN

DIA-NN - a universal automated software suite for DIA proteomics data analysis.
Other
237 stars 51 forks source link

Proteins.Identified in report.stats.tsv #888

Open zhangdong360 opened 6 months ago

zhangdong360 commented 6 months ago

Hey, I found that the number of Proteins.Identified in the rep.stats.tsv file could not match the number of proteins in other matrix tables. The number of Proteins Identified in the report.stats.tsv file is less than that in pr,pg and unique_genes. Again, it is not clear to me how to explain the relationship between these two quantities. How do I filter from report.tsv or another file to get the Proteins.Identified in report.stats.tsv ? I noticed your description of both in the README, but I still don't quite understand how to get the number of proteins reported here in stats report. Kind regards, Dong

vdemichev commented 6 months ago

Hi Dong,

Different files are produced using different filtering. Please see the output description in the docs. If in doubt, please always just use the main report only. The number of proteins in stats report can be reproduced by:

Best, Vadim