ftwkoopmans / msdap

MS-DAP: downstream analysis pipeline for quantitative proteomics
GNU General Public License v3.0
31 stars 7 forks source link

output tables from analysis_quickstart() #40

Closed brookegenovese closed 2 months ago

brookegenovese commented 2 months ago

Hi there, I'm looking for some clarification re: the output tables from running analysis_quickstart()that appear in the designated msdap_results timestamped folder. Apologies if this information is detailed somewhere in the vignettes and I've missed it!

Could you clarify how are the six .tsv files are different with respect to the 1) entries (peptides or proteins) included and 2) the calculated values for each protein within each sample? I sometimes use the output from msdap without running any statistically contrasts and want to ensure I'm using the correct datatable in my downstream bioinformatic analyses.

I'm assuming the as-is.tsv files contain the data as it is imported from whatever software you're using (in my case, Spectronaut) prior to using filter_dataset(), and as such would be prior to normalization/filtering...

Here are the .tsv's of interest, for reference. Thanks!

peptide_abundancefilter by group independently.tsv peptide_abundance__global data filter.tsv peptide_abundanceinput data as-is.tsv

protein_abundancefilter by group independently.tsv protein_abundance__global data filter.tsv protein_abundanceinput data as-is.tsv

ftwkoopmans commented 2 months ago

The protein-level output data are the same as their counterpart peptide-level tables (i.e. same filename), with the only difference that the MaxLFQ algorithm was applied to "rollup" from peptide- to protein-level.

There are 4 possible output table variants, with different filtering/normalization applied to them:

brookegenovese commented 1 month ago

This is what I needed - thanks!