Nesvilab / philosopher

PeptideProphet, PTMProphet, ProteinProphet, iProphet, Abacus, and FDR filtering
https://philosopher.nesvilab.org
GNU General Public License v3.0
110 stars 18 forks source link

cannot decode packed binary. msgpack: invalid code=d3 decoding bytes length #274

Closed apataskar closed 3 years ago

apataskar commented 3 years ago

Hi,

I get this error while running from filter step in the pipeline; (attached my parameter file). Could you help me solve it?

I said no to database search and peptide valdation, as untill this point there was no error!

Thanks a lot

Abhi

INFO[11:15:15] Executing Pipeline v4.0.0 INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on CPTAC_263d3f-I_blcdb9-I_c4155b-C_117C_W_BI_20140417_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0D0_BH-A0HK_C8-A12T_117C_W_BI_20130326_H-JQ INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0D2_C8-A12U_AR-A1AS_117C_W_BI_20131010_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0EV_AN-A0AM_D8-A142_117C_W_BI_20130625_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0SW_AO-A0JL_BH-A0BV_117C_W_BI_20131024_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0T6_E2-A158_E2-A15A_117C_W_BI_20130918_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0T7_C8-A12Q_A8-A079_117C_W_BI_20130820_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0YF_BH-A0DD_BH-A0E9_117C_W_BI_20131018_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0YG_E2-A150_BH-A18N_117C_W_BI_20130912_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A2-A0YM_BH-A0C7_A2-A0SX_117C_W_BI_20131025_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A7-A0CD_C8-A12W_AN-A0AL_117C_W_BI_20130913_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A7-A0CE_BH-A0C0_A2-A0YC_117C_W_BI_20130524_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A7-A0CJ_AO-A12F_A2-A0YL_117C_W_BI_20130805_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A8-A06Z_A2-A0D1_A2-A0CM_117C_W_BI_20130401_H-JQ INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A8-A09G_C8-A131_C8-A134_117C_W_BI_20131011_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_A8-A09I_C8-A12L_A2-A0EX_117C_W_BI_20130302_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:15] Initiating the workspace on TCGA_AN-A0FL_BH-A0DG_AN-A0AS_117C_W_BI_20130726_H-PM INFO[11:15:15] Creating workspace WARN[11:15:15] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A0JC_A8-A08Z_AR-A0TX_117C_W_BI_20130527_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A0JE_A2-A0T2_AN-A0AJ_117C_W_BI_20130802_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A0JM_C8-A12V_A8-A08G_117C_W_BI_20130927_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A12D_AN-A04A_BH-A0AV_117C_W_BI_20130310_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A12D_AN-A04A_BH-A0AV_117C_W_BI_20130416_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A12D_C8-A131_AO-A12B_117C_W_BI_20130208_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AO-A12E_A8-A06N_A2-A0T1_117C_W_BI_20130909_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A0TR_AO-A03O_BH-A18R_117C_W_BI_20130825_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A0TT_AR-A1AQ_AO-A12B_117C_W_BI_20131022_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A0TV_C8-A12Z_AO-A0JJ_117C_W_BI_20130730_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A0TY_AR-A0U4_BH-A0HP_117C_W_BI_20130408_H-JQ INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A1AP_AN-A0FK_AO-A0J6_117C_W_BI_20130517_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_AR-A1AW_AR-A1AV_C8-A135_117C_W_BI_20130628_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_BH-A0EE_AO-A0J9_BH-A0E0_117C_W_BI_20130412_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_BH-A18U_A2-A0YI_A2-A0EQ_117C_W_BI_20130405_H-JQ INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_BH-A18V_A7-A13F_BH-A0E1_117C_W_BI_20130520_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_C8-A12P_BH-A0C1_A2-A0EY_117C_W_BI_20130622_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_C8-A12P_BH-A0C1_A2-A0EY_117C_W_BI_20131202_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_C8-A138_E2-A154_BH-A0BZ_117C_W_BI_20130225_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_D8-A13Y_A8-A076_AO-A126_117C_W_BI_20130617_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_E2-A10A_BH-A18Q_C8-A130_117C_W_BI_20130222_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Initiating the workspace on TCGA_E2-A159_A2-A0T3_A2-A0YD_117C_W_BI_20130823_H-PM INFO[11:15:16] Creating workspace WARN[11:15:16] A meta data folder was found and will not be overwritten. INFO[11:15:16] Annotating the database INFO[11:16:49] Executing filter on CPTAC_263d3f-I_blcdb9-I_c4155b-C_117C_W_BI_20140417_H-PM INFO[11:16:49] Processing peptide identification files INFO[11:17:33] Printing models INFO[11:17:42] 1+ Charge profile decoy=0 target=0 INFO[11:17:42] 2+ Charge profile decoy=5451 target=92397 INFO[11:17:42] 3+ Charge profile decoy=3339 target=150985 INFO[11:17:42] 4+ Charge profile decoy=537 target=66108 INFO[11:17:42] 5+ Charge profile decoy=55 target=14075 INFO[11:17:42] 6+ Charge profile decoy=10 target=2368 INFO[11:17:43] Database search results ions=235690 peptides=146043 psms=335325 INFO[11:17:44] Converged to 1.00 % FDR with 304872 PSMs decoy=3077 threshold=0.5141 total=307949 INFO[11:17:50] Converged to 1.00 % FDR with 122394 Peptides decoy=1234 threshold=0.8858 total=123628 INFO[11:17:53] Converged to 1.00 % FDR with 210097 Ions decoy=2120 threshold=0.7503 total=212217 FATA[11:17:58] Cannot decode packed binary. msgpack: invalid code=d3 decoding bytes length

apataskar commented 3 years ago

analytics: true # reports when a workspace is created for usage statistics slackToken: # specify the Slack API token (how to generate a token: https://api.slack.com/legacy/custom-integrations/legacy-tokens) slackChannel: # specify the channel name, or slackUserID: # specify a user ID for a direct message

Steps: Database Search: no # peptide to spectrum matching with Comet or MSFragger Peptide Validation: no # peptide assignment validation with PeptideProphet PTM Localization: no # PTM site localization with PTMProphet Protein Inference: no # protein identification validation with ProteinProphet Label-Free Quantification: yes # precursor label-free quantification inspired by moFF Isobaric Quantification: yes # isobaric labeling-based relative quantification for TMT and iTRAQ Bio Cluster Quantification: no # protein report based on Uniprot protein clusters FDR Filtering: yes # statistical filtering, validation and false discovery r ates assessment Individual Reports: yes # multi-level reporting for both narrow-searches and open-searches Integrated Reports: no # combined analysis of LC-MS/MS results inspired by Abacus Integrated Isobaric Quantification: yes # integrates channel abundances from multiple isobaric-tagged samples with TMT-Integrator

Database Search: # MSFragger 3.2 & Comet protein_database: /DATA/apataskar/breast/philo/database/uniprot_human_W_F_rem_con.fas # path to the target-decoy protein database decoytag: rev # prefix tag used added to decoy sequences contaminant_tag: false # prefix tag used added to decoy sequences search_engine: msfragger # search engine options include "comet" and "msfragger" comet: # Comet v2019011 noindex: true # skip mzML file indexing param: # comet parameter file (default "comet.params.txt") extension: mzML # format of the spectra file msfragger: # MSFragger v3.0 path: /DATA/apataskar/breast/philo/bin/MSFragger-3.0/MSFragger-3.0.jar # path to MSFragger jar memory: 64 # how much memory in GB to use param: # MSFragger parameter file extension: mzML # spectra format data_type: 0 # 0 for DDA, 1 for DIA, 2 for DIA-narrow-window num_threads: 64 # 0=poll CPU to set num threads; else specify num threads directly (max 64) precursor_mass_lower: -20 # lower bound of the precursor mass window precursor_mass_upper: 20 # upper bound of the precursor mass window precursor_mass_units: 1 # 0=Daltons, 1=ppm precursor_true_tolerance: 20 # true precursor mass tolerance (window is +/- this value) precursor_true_units: 1 # 0=Daltons, 1=ppm fragment_mass_tolerance: 20 # fragment mass tolerance (window is +/- this value) fragment_mass_units: 1 # fragment mass tolerance units (0 for Da, 1 for ppm) calibrate_mass: 2 # 0=Off, 1=On, 2=On and find optimal parameters deneutralloss: 0 # Perform deneutrallossing or not (0=no, 1=yes) deisotope: 1 # activates deisotoping. isotope_error: -1/0/1/2/3 # 0=off, 0/1/2 (standard C13 error) mass_offsets: 0 # allow for additional precursor mass window shifts. Multiplexed with isotope_error. mass_offsets = 0/79.966 can be used as a restricted ‘open’ search that looks for unmodified and phosphorylated peptides (on any residue) precursor_mass_mode: selected # selected or isolated localize_delta_mass: 0 # this allows shifted fragment ions - fragment ions with mass increased by the calculated mass difference, to be included in scoring delta_mass_exclude_ranges: (-1.5,3.5) # exclude mass range for shifted ions searching fragment_ion_series: b,y # ion series used in search ion_series_definitions: # User defined ion series. (Example: b* N -17.026548;b0 N -18.010565) search_enzyme_name: Trypsin # name of enzyme to be written to the pepXML file search_enzyme_cutafter: KR # residues after which the enzyme cuts search_enzyme_butnotafter: P # residues that the enzyme will not cut before num_enzyme_termini: 2 # 2 for enzymatic, 1 for semi-enzymatic, 0 for nonspecific digestion allowed_missed_cleavage: 2 # maximum value is 5 clip_nTerm_M: 1 # specifies the trimming of a protein N-terminal methionine as a variable modification (0 or 1) variable_mod_01: 15.99490 M 3 # variable modification variable_mod_02: 42.01060 [^ 1 # variable modification variable_mod_03: 144.1021 n^ 1 # variable modification variable_mod_04: 144.1021 S 1 # variable modification variable_mod_05: # variable modification variable_mod_06: # variable modification variable_mod_07: # variable modification allow_multiple_variable_mods_on_residue: 0 # static mods are not considered max_variable_mods_per_peptide: 3 # maximum of 5 max_variable_mods_combinations: 5000 # maximum of 65534, limits number of modified peptides generated from sequence output_file_extension: pepXML # file extension of output files output_format: pepXML # file format of output files (pepXML or tsv) write_calibrated_mgf: 0 # Write calibrated MS2 scan to a MGF file (0 for No, 1 for Yes) output_report_topN: 1 # reports top N PSMs per input spectrum output_max_expect: 50 # suppresses reporting of PSM if top hit has expectation greater than this threshold report_alternative_proteins: 0 # 0=no, 1=yes precursor_charge: 1 6 # assume range of potential precursor charge states. Only relevant when override_charge is set to 1 override_charge: 0 # 0=no, 1=yes to override existing precursor charge states with precursor_charge parameter digest_min_length: 7 # minimum length of peptides to be generated during in-silico digestion digest_max_length: 50 # maximum length of peptides to be generated during in-silico digestion digest_mass_range: 500.0 5000.0 # mass range of peptides to be generated during in-silico digestion in Daltons max_fragment_charge: 2 # maximum charge state for theoretical fragments to match (1-4) track_zero_topN: 0 # in addition to topN results, keep track of top results in zero bin zero_bin_accept_expect: 0 # boost top zero bin entry to top if it has expect under 0.01 - set to 0 to disable zero_bin_mult_expect: 1 # disabled if above passes - multiply expect of zero bin for ordering purposes (does not affect reported expect) add_topN_complementary: 0 # inserts complementary ions corresponding to the top N most intense fragments in each experimental spectra minimum_peaks: 15 # required minimum number of peaks in spectrum to search (default 10) use_topN_peaks: 300 # pre-process experimental spectrum to only use top N peaks min_fragments_modelling: 2 # minimum number of matched peaks in PSM for inclusion in statistical modeling min_matched_fragments: 4 # minimum number of matched peaks for PSM to be reported minimum_ratio: 0.01 # filters out all peaks in experimental spectrum less intense than this multiple of the base peak intensity clear_mz_range: 125.0 131.0 # for iTRAQ/TMT type data; will clear out all peaks in the specified m/z range remove_precursor_peak: 0 # remove precursor peaks from tandem mass spectra. 0=not remove; 1=remove the peak with precursor charge; 2=remove the peaks with all charge states. remove_precursor_range: -1.5,1.5 # m/z range in removing precursor peaks. Unit: Da. intensity_transform: 0 # transform peaks intensities with sqrt root. 0=not transform; 1=transform using sqrt root. mass_diff_to_variable_mod: 0 # Put mass diff as a variable modification. 0 for no; 1 for yes and change the original mass diff and the calculated mass accordingly; 2 for yes but do not change the original mass diff. labile_search_mode: off # type of search (nglycan, labile, or off). Off means non-labile/typical search. restrict_deltamass_to: all # Specify amino acids on which delta masses (mass offsets or search modifications) can occur. Allowed values are single letter codes (e.g. ACD), must diagnostic_intensity_filter: 0 # [nglycan/labile search_mode only]. Minimum relative intensity for SUM of all detected oxonium ions to achieve for spectrum to contain diagnostic fragment evidence. Calculated relative to spectrum base peak. 0 <= value. Y_type_masses: 0 # [nglycan/labile search_mode only]. Specify fragments of labile mods that are commonly retained on intact peptides (e.g. Y ions for glycans). Only used if 'Y' is included in fragment_ion_series. diagnostic_fragments: 1 # [nglycan/labile search_mode only]. Specify diagnostic fragments of labile mods that appear in the low m/z region. Only used if diagnostic_intensity_filter > 0. add_Cterm_peptide: 0.000000 # c-term peptide fixed modifications add_Cterm_protein: 0.000000 # c-term protein fixed modifications add_Nterm_peptide: 0.000000 # n-term peptide fixed modifications add_Nterm_protein: 0.000000 # n-term protein fixed modifications add_A_alanine: 0.000000 # alanine fixed modifications add_C_cysteine: 57.021464 # cysteine fixed modifications add_D_aspartic_acid: 0.000000 # aspartic acid fixed modifications add_E_glutamic_acid: 0.000000 # glutamic acid fixed modifications add_F_phenylalanine: 0.000000 # phenylalanine fixed modifications add_G_glycine: 0.000000 # glycine fixed modifications add_H_histidine: 0.000000 # histidine fixed modifications add_I_isoleucine: 0.000000 # isoleucine fixed modifications add_K_lysine: 144.1021 # lysine fixed modifications add_L_leucine: 0.000000 # leucine fixed modifications add_M_methionine: 0.000000 # methionine fixed modifications add_N_asparagine: 0.000000 # asparagine fixed modifications add_P_proline: 0.000000 # proline fixed modifications add_Q_glutamine: 0.000000 # glutamine fixed modifications add_R_arginine: 0.000000 # arginine fixed modifications add_S_serine: 0.000000 # serine fixed modifications add_T_threonine: 0.000000 # threonine fixed modifications add_V_valine: 0.000000 # valine fixed modifications add_W_tryptophan: 0.000000 # tryptophan fixed modifications add_Y_tyrosine: 0.000000 # tyrosine fixed modifications

Peptide Validation: # PeptideProphet v5.2 concurrent: false # Concurrent execution of multiple instaces extension: pepXML # pepXML file extension clevel: 0 # set Conservative Level in neg_stdev from the neg_mean, low numbers are less conservative, high numbers are more conservative accmass: true # use Accurate Mass model binning decoyprobs: true # compute possible non-zero probabilities for Decoy entries on the last iteration enzyme: trypsin # enzyme used in sample (optional) exclude: false # exclude deltaCn, Mascot, and Comet results from results (default Penalize results) expectscore: true # use expectation value as the only contributor to the f-value for modeling forcedistr: false # bypass quality control checks, report model despite bad modeling glyc: false # enable peptide Glyco motif model icat: false # apply ICAT model (default Autodetect ICAT) instrwarn: false # warn and continue if combined data was generated by different instrument models leave: false # leave alone deltaCn, Mascot, and Comet results from results (default Penalize results) maldi: false # enable MALDI mode masswidth: 5 # model mass width (default 5) minpeplen: 7 # minimum peptide length not rejected (default 7) minpintt: 2 # minimum number of NTT in a peptide used for positive pI model (default 2) minpiprob: 0.9 # minimum probability after first pass of a peptide used for positive pI model (default 0.9) minprob: 0.05 # report results with minimum probability (default 0.05) minrtntt: 2 # minimum number of NTT in a peptide used for positive RT model (default 2) minrtprob: 0.9 # minimum probability after first pass of a peptide used for positive RT model (default 0.9) neggamma: false # use Gamma distribution to model the negative hits noicat: false # do no apply ICAT model (default Autodetect ICAT) nomass: false # disable mass model nonmc: false # disable NMC missed cleavage model nonparam: true # use semi-parametric modeling, must be used in conjunction with --decoy option nontt: false # disable NTT enzymatic termini model optimizefval: false # (SpectraST only) optimize f-value function f(dot,delta) using PCA phospho: false # enable peptide Phospho motif model pi: false # enable peptide pI model ppm: true # use PPM mass error instead of Dalton for mass modeling zero: false # report results with minimum probability 0

PTM Localization: # PTMProphet v6.0 autodirect: false # use direct evidence when the lability is high, use in combination with LABILITY cions: # use specified C-term ions, separate multiple ions by commas (default: y for CID, z for ETD) direct: false # use only direct evidence for evaluating PTM site probabilities em: 2 # set EM models to 0 (no EM), 1 (Intensity EM Model Applied) or 2 (Intensity and Matched Peaks EM Models Applied) static: false # use static fragppmtol for all PSMs instead of dynamically estimates offsets and tolerances fragppmtol: 15 # when computing PSM-specific mass_offset and mass_tolerance, use specified default +/- MS2 mz tolerance on fragment ions ifrags: false # use internal fragments for localization keepold: false # retain old PTMProphet results in the pepXML file lability: false # compute Lability of PTMs massdiffmode: false # use the Mass Difference and localize excludemassdiffmin: 0 # Minimum mass difference excluded for MASSDIFFFMODE analysis (default=0) excludemassdiffmax: 0 # Maximun mass difference excluded for MASSDIFFFMODE analysis (default=0) massoffset: 0 # adjust the massdiff by offset (0 = use default) maxfragz: 0 # limit maximum fragment charge (default: 0=precursor charge, negative values subtract from precursor charge) maxthreads: 4 # use specified number of threads for processing mino: 0 # use specified number of pseudo-counts when computing Oscore (0 = use default) minprob: 0 # use specified minimum probability to evaluate peptides mods: # specify modifications nions: # use specified N-term ions, separate multiple ions by commas (default: a,b for CID, c for ETD) nominofactor: false # disable MINO factor correction when MINO= is set greater than 0 (default: apply MINO factor correction) ppmtol: 1 # use specified +/- MS1 ppm tolerance on peptides which may have a slight offset depending on search parameters verbose: false # produce Warnings to help troubleshoot potential PTM shuffling or mass difference issues

Protein Inference: # ProteinProphet v5.2 accuracy: false # equivalent to --minprob 0 allpeps: false # consider all possible peptides in the database in the confidence model confem: false # use the EM to compute probability given the confidence delude: false # do NOT use peptide degeneracy information when assessing proteins excludezeros: false # exclude zero prob entries fpkm: false # model protein FPKM values glyc: false # highlight peptide N-glycosylation motif icat: false # highlight peptide cysteines instances: false # use Expected Number of Ion Instances to adjust the peptide probabilities prior to NSP adjustment iprophet: false # input is from iProphet logprobs: false # use the log of the probabilities in the Confidence calculations maxppmdiff: 20 # maximum peptide mass difference in PPM (default 20) minprob: 0.05 # peptideProphet probabilty threshold (default 0.05) mufactor: 1 # fudge factor to scale MU calculation (default 1) nogroupwts: false # check peptide's Protein weight against the threshold (default: check peptide's Protein Group weight against threshold) nonsp: false # do not use NSP model nooccam: false # non-conservative maximum protein list noprotlen: false # do not report protein length normprotlen: false # normalize NSP using Protein Length protmw: false # get protein mol weights softoccam: false # peptide weights are apportioned equally among proteins within each Protein Group (less conservative protein count estimate) unmapped: false # report results for UNMAPPED proteins

Label-Free Quantification: # Freequant peakTimeWindow: 0.4 # specify the time windows for the peak (minute) (default 0.4) retentionTimeWindow: 3 # specify the retention time window for xic (minute) (default 3) tolerance: 10 # m/z tolerance in ppm (default 10) raw: false # read raw files instead of converted mzML, or mzXML faims: false # use FAIMS information for the quantification

Isobaric Quantification: # Labelquant bestPSM: true # select the best PSMs for protein quantification level: 2 # ms level for the quantification minProb: 0.7 # only use PSMs with a minimum probability score plex: 4 # number of channels purity: 0.5 # ion purity threshold (default 0.5) removeLow: 0.05 # ignore the lower 3% PSMs based on their summed abundances tolerance: 20 # m/z tolerance in ppm (default 20) uniqueOnly: false # report quantification based on only unique peptides brand: itraq # isobaric labeling brand (tmt, itraq) raw: false # read raw files instead of converted mzML, or mzXML

Bio Cluster Quantification: # BioQuant organismUniProtID: # UniProt proteome ID level: 0.9 # cluster identity level (default 0.9)

FDR Filtering: # Filter psmFDR: 0.01 # psm FDR level (default 0.01) peptideFDR: 0.01 # peptide FDR level (default 0.01) ionFDR: 0.01 # peptide ion FDR level (default 0.01) proteinFDR: 0.01 # protein FDR level (default 0.01) peptideProbability: 0.7 # top peptide probability threshold for the FDR filtering (default 0.7) proteinProbability: 0.5 # protein probability threshold for the FDR filtering (not used with the razor algorithm) (default 0.5) peptideWeight: 1 # threshold for defining peptide uniqueness (default 1) razor: true # use razor peptides for protein FDR scoring picked: true # apply the picked FDR algorithm before the protein scoring mapMods: true # map modifications acquired by an open search models: true # print model distribution sequential: true # alternative algorithm that estimates FDR using both filtered PSM and Protein lists

Individual Reports: # Report msstats: false # create an output compatible to MSstats withDecoys: false # add decoy observations to reports mzID: false # create a mzID output

Integrated Reports: # Abacus protein: true # global level protein report peptide: true # global level peptide report proteinProbability: 0.9 # minimum protein probability (default 0.9) peptideProbability: 0.5 # minimum peptide probability (default 0.5) uniqueOnly: yes # report TMT quantification based on only unique peptides reprint: false # create abacus reports using the Reprint format

Integrated Isobaric Quantification: # TMT-Integrator v1.1.10 path: /DATA/apataskar/breast/philo/bin/TMTIntegrator-v3.0.0.jar # path to TMT-Integrator jar memory: 64 # memory allocation, in Gb output: # the location of output files channel_num: 4 # number of channels in the multiplex (e.g. 10, 11) ref_tag: 117 # unique tag for identifying the reference channel (Bridge sample added to each multiplex) groupby: -1 # level of data summarization(0: PSM aggregation to the gene level; 1: protein; 2: peptide sequence; 3: PTM site; -1: generate reports at all levels) psm_norm: false # perform additional retention time-based normalization at the PSM level outlier_removal: true # perform outlier removal prot_norm: -1 # normalization (0: None; 1: MD (median centering); 2: GN (median centering + variance scaling); -1: generate reports with all normalization options) min_pep_prob: 0.9 # minimum PSM probability threshold (in addition to FDR-based filtering by Philosopher) min_purity: 0.5 # ion purity score threshold min_percent: 0.05 # remove low intensity PSMs (e.g. value of 0.05 indicates removal of PSMs with the summed TMT reporter ions intensity in the lowest 5% of all PSMs) unique_pep: false # allow PSMs with unique peptides only (if true) or unique plus razor peptides (if false), as classified by Philosopher and defined in PSM.tsv files unique_gene: 0 # additional, gene-level uniqueness filter (0: allow all PSMs; 1: remove PSMs mapping to more than one GENE with evidence of expression in the dataset; 2:remove all PSMs mapping to more than one GENE in the fasta file) best_psm: true # keep the best PSM only (highest summed TMT intensity) among all redundant PSMs within the same LC-MS run prot_exclude: none # exclude proteins with specified tags at the beginning of the accession number (e.g. none: no exclusion; sp|,tr| : exclude protein with sp| or tr|) allow_overlabel: true # allow PSMs with TMT on S (when overlabeling on S was allowed in the database search) allow_unlabeled: true # allow PSMs without TMT tag or acetylation on the peptide n-terminus mod_tag: none # PTM info for generation of PTM-specific reports (none: for Global data; S[167],T[181],Y[243]: for Phospho; K[170]: for K-Acetyl) min_site_prob: -1 # site localization confidence threshold (-1: for Global; 0: as determined by the search engine; above 0 (e.g. 0.75): PTMProphet probability, to be used with phosphorylation only) ms1_int: true # use MS1 precursor ion intensity (if true) or MS2 summed TMT reporter ion intensity (if false) as part of the reference sample abundance estimation top3_pep: true # use top 3 most intense peptide ions as part of the reference sample abundance estimation print_RefInt: false # print individual reference sample abundance estimates for each multiplex in the final reports (in addition to the combined reference sample abundance estimate) add_Ref: -1 # add an artificial reference channel if there is no reference channel (-1: don't add the reference; 0: use summation as the reference; 1: use average as the reference; 2: use median as the reference) max_pep_prob_thres: 0 # the threshold for maximum peptide probability min_ntt: 0 # minimum allowed number of enzymatic termini

prvst commented 3 years ago

Do you have database search results, and PeptideProphet files already?

apataskar commented 3 years ago

Its solved! Thanks for reply ☺

From: Felipe da Veiga Leprevost @.> Sent: maandag 27 september 2021 15:43 To: Nesvilab/philosopher @.> Cc: Abhi Pataskar @.>; Author @.> Subject: Re: [Nesvilab/philosopher] cannot decode packed binary. msgpack: invalid code=d3 decoding bytes length (#274)

Do you have database search results, and PeptideProphet files already?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/Nesvilab/philosopher/issues/274#issuecomment-927887570, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AKIRNFO6Z4DHGBBISHRGFDDUEBYERANCNFSM5DWX63VA. Triage notifications on the go with GitHub Mobile for iOShttps://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Androidhttps://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

stsour commented 2 years ago

Hi! I am running into the same issue here. I am runnning philosopher with Comet, PeptideProphet, ProteinProphet, which all proceed without issue. When it gets up to FDR filtering, I get this same error.

Here is the log for the filtering step:

INFO[21:10:07] Executing filter on S1
INFO[21:10:07] Processing peptide identification files
INFO[21:10:10] Printing models
INFO[21:10:13] 1+ Charge profile decoy=0 target=0 INFO[21:10:13] 2+ Charge profile decoy=202 target=3807 INFO[21:10:13] 3+ Charge profile decoy=141 target=5586 INFO[21:10:13] 4+ Charge profile decoy=56 target=1858 INFO[21:10:13] 5+ Charge profile decoy=27 target=471 INFO[21:10:13] 6+ Charge profile decoy=5 target=177 INFO[21:10:13] Database search results ions=9328 peptides=8426 psms=12330 INFO[21:10:13] Converged to 1.00 % FDR with 10838 PSMs decoy=109 threshold=0.5245 total=10947 INFO[21:10:14] Converged to 1.00 % FDR with 6960 Peptides decoy=70 threshold=0.6801 total=7030 INFO[21:10:14] Converged to 1.00 % FDR with 7840 Ions decoy=79 threshold=0.6482 total=7919 FATA[21:10:14] Cannot decode packed binary. msgpack: invalid code=d3 decoding bytes length

When i try to skip the filtering step and just do label-free quant, I get a similar error: INFO[21:12:19] Executing Pipeline v4.1.1
INFO[21:12:19] Creating workspace
WARN[21:12:19] A meta data folder was found and will not be overwritten.
INFO[21:12:19] Initiating the workspace on S1
INFO[21:12:19] Creating workspace
WARN[21:12:19] A meta data folder was found and will not be overwritten.
INFO[21:12:19] Annotating the database
INFO[21:12:33] Executing label-free quantification on S1
FATA[21:12:34] Cannot unmarshal file:msgpack: invalid code=cb decoding bytes length

I already have search results, interact.pep.xml and interact.prot.xml files.

This is my parameter file (search, peptide and protein validation set to no since these steps processed already)

Steps: Database Search: no # peptide to spectrum matching with Comet or MSFragger Peptide Validation: no # peptide assignment validation with PeptideProphe PTM Localization: no # PTM site localization with PTMProphet Protein Inference: no # protein identification validation with ProteinProphet Label-Free Quantification: yes # precursor label-free quantification inspired by moFF Isobaric Quantification: yes # isobaric labeling-based relative quantification for TMT and iTRAQ Bio Cluster Quantification: no # protein report based on Uniprot protein clusters FDR Filtering: yes # statistical filtering, validation and false discovery rates assessment Individual Reports: no # multi-level reporting for both narrow-searches and open-searches Integrated Reports: no # combined analysis of LC-MS/MS results inspired by Abacus Integrated Isobaric Quantification: no # integrates channel abundances from multiple isobaric-tagged samples with TMT-Integrator

Database Search: # MSFragger-3.4 & Comet v2019011 protein_database: /scratch/tsour.s/S1_philosopher/database/2022-02-05-decoys-contam-S1_MTP_fasta.fasta.fas # path to the target-decoy protein database decoytag: rev # prefix tag used added to decoy sequences contaminant_tag: false # prefix tag used added to decoy sequences search_engine: comet # search engine options include "comet" and "msfragger" comet: # Comet v2019011 noindex: true # skip mzML file indexing param: comet.params.txt # comet parameter file (default "comet.params.txt") extension: mzML # format of the spectra file msfragger: # MSFragger v3.3 path: /scratch/tsour.s/S1_philosopher/bin/MSFragger-3.4.jar # path to MSFragger jar memory: 100 # how much memory in GB to use param: params/fragger.params # MSFragger parameter file extension: mzML # spectra format num_threads: 28 # number of CPU threads to use. 0=poll CPU to set num threads precursor_mass_lower: -20 # lower bound of the precursor mass window precursor_mass_upper: 20 # upper bound of the precursor mass window precursor_mass_units: 1 # 0=Daltons, 1=ppm data_type: 0 # 0 for DDA, 1 for DIA, 2 for DIA-narrow-window precursor_true_tolerance: 20 # true precursor mass tolerance (window is +/- this value) precursor_true_units: 1 # 0=Daltons, 1=ppm fragment_mass_tolerance: 20 # fragment mass tolerance (window is +/- this value) fragment_mass_units: 1 # fragment mass tolerance units (0 for Da, 1 for ppm) calibrate_mass: 2 # 0=Off, 1=On, 2=On and find optimal parameters use_all_mods_in_first_search: 0 # use all variable modifications in first search (0 for No, 1 for Yes). write_calibrated_mgf: 0 # write calibrated MS2 scan to a MGF file (0 for No, 1 for Yes) isotope_error: 0/1/2 # 0=off, 0/1/2 (standard C13 error) mass_offsets: 0 # allow for additional precursor mass window shifts. Multiplexed with isotope_error. mass_offsets = 0/79.966 can be used as a restricted ‘open’ search that looks for unmodified and phosphorylated peptides (on any residue) restrict_deltamass_to: all # specify amino acids on which delta masses (mass offsets or search modifications) can occur. Allowed values are single letter codes (e.g. ACD), must precursor_mass_mode: selected # one of isolated/selected/corrected. localize_delta_mass: 0 # this allows shifted fragment ions - fragment ions with mass increased by the calculated mass difference, to be included in scoring delta_mass_exclude_ranges: (-1.5,3.5) # exclude mass range for shifted ions searching fragment_ion_series: b,y # ion series used in search ion_series_definitions: # user defined ion series. (Example: b* N -17.026548;b0 N -18.010565) search_enzyme_name_1: Trypsin # Name of the first enzyme. search_enzyme_cut_1: KR # First enzyme's cutting amino acid. search_enzyme_nocut_1: P # First enzyme's protecting amino acid. allowed_missed_cleavage_1: 2 # First enzyme's allowed number of missed cleavages per peptide. Maximum value is 5. search_enzyme_sense_1: C # First enzyme's cutting terminal. search_enzyme_name_2: # Name of the second enzyme. search_enzyme_cut_2: # Second enzyme's cutting amino acid. search_enzyme_nocut_2: # Second enzyme's protecting amino acid. allowed_missed_cleavage_2: # Second enzyme's allowed number of missed cleavages per peptide. Maximum value is 5. search_enzyme_sense_2: C # Second enzyme's cutting terminal. num_enzyme_termini: 2 # 2 for enzymatic, 1 for semi-enzymatic, 0 for nonspecific digestion clip_nTerm_M: 1 # specifies the trimming of a protein N-terminal methionine as a variable modification (0 or 1) variable_mod_01: 15.99490 M 3 # variable modification variable_mod_02: 42.01060 [^ 1 # variable modification variable_mod_03: 229.162932 n^ 1 # variable modification variable_mod_04: 229.162932 S 1 # variable modification variable_mod_05: # variable modification variable_mod_06: # variable modification variable_mod_07: # variable modification allow_multiple_variable_mods_on_residue: 0 # static mods are not considered max_variable_mods_per_peptide: 3 # maximum of 5 max_variable_mods_combinations: 5000 # maximum of 65534, limits number of modified peptides generated from sequence mass_diff_to_variable_mod: 0 # put mass diff as a variable modification. 0 for no; 1 for yes and change the original mass diff and the calculated mass accordingly; 2 for yes but do not change the original mass diff. output_file_extension: pepXML # file extension of output files output_format: pepXML # file format of output files (pepXML or tsv) output_report_topN: 1 # reports top N PSMs per input spectrum output_max_expect: 50 # suppresses reporting of PSM if top hit has expectation greater than this threshold report_alternative_proteins: 0 # 0=no, 1=yes precursor_charge: 1 6 # assume range of potential precursor charge states. Only relevant when override_charge is set to 1 override_charge: 0 # 0=no, 1=yes to override existing precursor charge states with precursor_charge parameter digest_min_length: 7 # minimum length of peptides to be generated during in-silico digestion digest_max_length: 50 # maximum length of peptides to be generated during in-silico digestion digest_mass_range: 500.0 5000.0 # mass range of peptides to be generated during in-silico digestion in Daltons max_fragment_charge: 2 # maximum charge state for theoretical fragments to match (1-4) track_zero_topN: 0 # in addition to topN results, keep track of top results in zero bin zero_bin_accept_expect: 0 # boost top zero bin entry to top if it has expect under 0.01 - set to 0 to disable zero_bin_mult_expect: 1 # disabled if above passes - multiply expect of zero bin for ordering purposes (does not affect reported expect) add_topN_complementary: 0 # inserts complementary ions corresponding to the top N most intense fragments in each experimental spectra check_spectral_files: 1 # check the spectral files before searching. minimum_peaks: 15 # required minimum number of peaks in spectrum to search (default 10) use_topN_peaks: 300 # pre-process experimental spectrum to only use top N peaks deisotope: 1 # activates deisotoping. deneutralloss: 1 # performs deneutrallossing or not (0=no, 1=yes) min_fragments_modelling: 2 # minimum number of matched peaks in PSM for inclusion in statistical modeling min_matched_fragments: 4 # minimum number of matched peaks for PSM to be reported minimum_ratio: 0.01 # filters out all peaks in experimental spectrum less intense than this multiple of the base peak intensity clear_mz_range: 125.5 131.5 # for iTRAQ/TMT type data; will clear out all peaks in the specified m/z range remove_precursor_peak: 0 # remove precursor peaks from tandem mass spectra. 0=not remove; 1=remove the peak with precursor charge; 2=remove the peaks with all charge states. remove_precursor_range: -1.5,1.5 # m/z range in removing precursor peaks. Unit: Da. intensity_transform: 0 # transform peaks intensities with sqrt root. 0=not transform; 1=transform using sqrt root. labile_search_mode: off # type of search (nglycan, labile, or off). Off means non-labile/typical search. diagnostic_intensity_filter: 0 # [nglycan/labile search_mode only]. Minimum relative intensity for SUM of all detected oxonium ions to achieve for spectrum to contain diagnostic fragment evidence. Calculated relative to spectrum base peak. 0 <= value. Y_type_masses: # [nglycan/labile search_mode only]. Specify fragments of labile mods that are commonly retained on intact peptides (e.g. Y ions for glycans). Only used if 'Y' is included in fragment_ion_series. diagnostic_fragments: # [nglycan/labile search_mode only]. Specify diagnostic fragments of labile mods that appear in the low m/z region. Only used if diagnostic_intensity_filter > 0. add_Cterm_peptide: 0.000000 # c-term peptide fixed modifications add_Cterm_protein: 0.000000 # c-term protein fixed modifications add_Nterm_peptide: 0.000000 # n-term peptide fixed modifications add_Nterm_protein: 0.000000 # n-term protein fixed modifications add_A_alanine: 0.000000 # alanine fixed modifications add_C_cysteine: 57.021464 # cysteine fixed modifications add_D_aspartic_acid: 0.000000 # aspartic acid fixed modifications add_E_glutamic_acid: 0.000000 # glutamic acid fixed modifications add_F_phenylalanine: 0.000000 # phenylalanine fixed modifications add_G_glycine: 0.000000 # glycine fixed modifications add_H_histidine: 0.000000 # histidine fixed modifications add_I_isoleucine: 0.000000 # isoleucine fixed modifications add_K_lysine: 229.162932 # lysine fixed modifications add_L_leucine: 0.000000 # leucine fixed modifications add_M_methionine: 0.000000 # methionine fixed modifications add_N_asparagine: 0.000000 # asparagine fixed modifications add_P_proline: 0.000000 # proline fixed modifications add_Q_glutamine: 0.000000 # glutamine fixed modifications add_R_arginine: 0.000000 # arginine fixed modifications add_S_serine: 0.000000 # serine fixed modifications add_T_threonine: 0.000000 # threonine fixed modifications add_V_valine: 0.000000 # valine fixed modifications add_W_tryptophan: 0.000000 # tryptophan fixed modifications add_Y_tyrosine: 0.000000 # tyrosine fixed modifications

Peptide Validation: # PeptideProphet v5.2 concurrent: false # Concurrent execution of multiple instaces extension: pep.xml # pepXML file extension clevel: 0 # set Conservative Level in neg_stdev from the neg_mean, low numbers are less conservative, high numbers are more conservative accmass: true # use Accurate Mass model binning decoyprobs: true # compute possible non-zero probabilities for Decoy entries on the last iteration enzyme: trypsin # enzyme used in sample (optional) exclude: false # exclude deltaCn, Mascot, and Comet results from results (default Penalize results) expectscore: true # use expectation value as the only contributor to the f-value for modeling forcedistr: false # bypass quality control checks, report model despite bad modeling glyc: false # enable peptide Glyco motif model icat: false # apply ICAT model (default Autodetect ICAT) instrwarn: false # warn and continue if combined data was generated by different instrument models leave: false # leave alone deltaCn, Mascot, and Comet results from results (default Penalize results) maldi: false # enable MALDI mode masswidth: 5 # model mass width (default 5) minpeplen: 7 # minimum peptide length not rejected (default 7) minpintt: 2 # minimum number of NTT in a peptide used for positive pI model (default 2) minpiprob: 0.9 # minimum probability after first pass of a peptide used for positive pI model (default 0.9) minprob: 0.05 # report results with minimum probability (default 0.05) minrtntt: 2 # minimum number of NTT in a peptide used for positive RT model (default 2) minrtprob: 0.9 # minimum probability after first pass of a peptide used for positive RT model (default 0.9) neggamma: false # use Gamma distribution to model the negative hits noicat: false # do no apply ICAT model (default Autodetect ICAT) nomass: false # disable mass model nonmc: false # disable NMC missed cleavage model nonparam: true # use semi-parametric modeling, must be used in conjunction with --decoy option nontt: false # disable NTT enzymatic termini model optimizefval: false # (SpectraST only) optimize f-value function f(dot,delta) using PCA phospho: false # enable peptide Phospho motif model pi: false # enable peptide pI model ppm: true # use PPM mass error instead of Dalton for mass modeling zero: false # report results with minimum probability 0

PTM Localization: # PTMProphet v6.0 autodirect: false # use direct evidence when the lability is high, use in combination with LABILITY cions: # use specified C-term ions, separate multiple ions by commas (default: y for CID, z for ETD) direct: false # use only direct evidence for evaluating PTM site probabilities em: 2 # set EM models to 0 (no EM), 1 (Intensity EM Model Applied) or 2 (Intensity and Matched Peaks EM Models Applied) static: false # use static fragppmtol for all PSMs instead of dynamically estimates offsets and tolerances fragppmtol: 15 # when computing PSM-specific mass_offset and mass_tolerance, use specified default +/- MS2 mz tolerance on fragment ions ifrags: false # use internal fragments for localization keepold: false # retain old PTMProphet results in the pepXML file lability: false # compute Lability of PTMs massdiffmode: false # use the Mass Difference and localize excludemassdiffmin: 0 # Minimum mass difference excluded for MASSDIFFFMODE analysis (default=0) excludemassdiffmax: 0 # Maximun mass difference excluded for MASSDIFFFMODE analysis (default=0) massoffset: 0 # adjust the massdiff by offset (0 = use default) maxfragz: 0 # limit maximum fragment charge (default: 0=precursor charge, negative values subtract from precursor charge) maxthreads: 4 # use specified number of threads for processing mino: 0 # use specified number of pseudo-counts when computing Oscore (0 = use default) minprob: 0 # use specified minimum probability to evaluate peptides mods: # specify modifications nions: # use specified N-term ions, separate multiple ions by commas (default: a,b for CID, c for ETD) nominofactor: false # disable MINO factor correction when MINO= is set greater than 0 (default: apply MINO factor correction) ppmtol: 20 # use specified +/- MS1 ppm tolerance on peptides which may have a slight offset depending on search parameters verbose: false # produce Warnings to help troubleshoot potential PTM shuffling or mass difference issues

Protein Inference: # ProteinProphet v5.2 accuracy: false # equivalent to --minprob 0 allpeps: false # consider all possible peptides in the database in the confidence model confem: false # use the EM to compute probability given the confidence delude: false # do NOT use peptide degeneracy information when assessing proteins excludezeros: false # exclude zero prob entries fpkm: false # model protein FPKM values glyc: false # highlight peptide N-glycosylation motif icat: false # highlight peptide cysteines instances: false # use Expected Number of Ion Instances to adjust the peptide probabilities prior to NSP adjustment iprophet: false # input is from iProphet logprobs: false # use the log of the probabilities in the Confidence calculations maxppmdiff: 20 # maximum peptide mass difference in PPM (default 20) minprob: 0 # peptideProphet probabilty threshold (default 0.05) mufactor: 1 # fudge factor to scale MU calculation (default 1) nogroupwts: false # check peptide's Protein weight against the threshold (default: check peptide's Protein Group weight against threshold) nonsp: false # do not use NSP model nooccam: false # non-conservative maximum protein list noprotlen: false # do not report protein length normprotlen: false # normalize NSP using Protein Length protmw: false # get protein mol weights softoccam: false # peptide weights are apportioned equally among proteins within each Protein Group (less conservative protein count estimate) unmapped: false # report results for UNMAPPED proteins

Label-Free Quantification: # Freequant peakTimeWindow: 0.4 # specify the time windows for the peak (minute) (default 0.4) retentionTimeWindow: 3 # specify the retention time window for xic (minute) (default 3) tolerance: 10 # m/z tolerance in ppm (default 10) raw: false # read raw files instead of converted mzML, or mzXML faims: false # use FAIMS information for the quantification

Isobaric Quantification: # Labelquant bestPSM: true # select the best PSMs for protein quantification level: 2 # ms level for the quantification minProb: 0.7 # only use PSMs with a minimum probability score plex: 10 # number of channels purity: 0.5 # ion purity threshold (default 0.5) removeLow: 0.05 # ignore the lower 3% PSMs based on their summed abundances tolerance: 20 # m/z tolerance in ppm (default 20) uniqueOnly: true # report quantification based on only unique peptides brand: tmt # isobaric labeling brand (tmt, itraq) raw: false # read raw files instead of converted mzML, or mzXML

Bio Cluster Quantification: # BioQuant organismUniProtID: # UniProt proteome ID level: 0.9 # cluster identity level (default 0.9)

FDR Filtering: # Filter psmFDR: 0.01 # psm FDR level (default 0.01) peptideFDR: 0.01 # peptide FDR level (default 0.01) ionFDR: 0.01 # peptide ion FDR level (default 0.01) proteinFDR: 0.99 # protein FDR level (default 0.01) peptideProbability: 0.7 # top peptide probability threshold for the FDR filtering (default 0.7) proteinProbability: 0.5 # protein probability threshold for the FDR filtering (not used with the razor algorithm) (default 0.5) peptideWeight: 1 # threshold for defining peptide uniqueness (default 1) razor: true # use razor peptides for protein FDR scoring picked: true # apply the picked FDR algorithm before the protein scoring mapMods: false # map modifications acquired by an open search models: true # print model distribution sequential: false # alternative algorithm that estimates FDR using both filtered PSM and Protein lists

Individual Reports: # Report msstats: false # create an output compatible to MSstats withDecoys: false # add decoy observations to reports mzID: false # create a mzID output

Integrated Reports: # Abacus protein: true # global level protein report peptide: true # global level peptide report proteinProbability: 0.9 # minimum protein probability (default 0.9) peptideProbability: 0.5 # minimum peptide probability (default 0.5) uniqueOnly: false # report TMT quantification based on only unique peptides reprint: false # create abacus reports using the Reprint format

Integrated Isobaric Quantification: # TMT-Integrator v3.2.0 path: # path to TMT-Integrator jar memory: 6 # memory allocation, in Gb output: # the location of output files channel_num: 10 # number of channels in the multiplex (e.g. 10, 11) ref_tag: Bridge # unique tag for identifying the reference channel (Bridge sample added to each multiplex) groupby: -1 # level of data summarization(0: PSM aggregation to the gene level; 1: protein; 2: peptide sequence; 3: PTM site; -1: generate reports at all levels) psm_norm: false # perform additional retention time-based normalization at the PSM level outlier_removal: true # perform outlier removal prot_norm: -1 # normalization (0: None; 1: MD (median centering); 2: GN (median centering + variance scaling); -1: generate reports with all normalization options) min_pep_prob: 0.9 # minimum PSM probability threshold (in addition to FDR-based filtering by Philosopher) min_purity: 0.5 # ion purity score threshold min_percent: 0.05 # remove low intensity PSMs (e.g. value of 0.05 indicates removal of PSMs with the summed TMT reporter ions intensity in the lowest 5% of all PSMs) unique_pep: false # allow PSMs with unique peptides only (if true) or unique plus razor peptides (if false), as classified by Philosopher and defined in PSM.tsv files unique_gene: 0 # additional, gene-level uniqueness filter (0: allow all PSMs; 1: remove PSMs mapping to more than one GENE with evidence of expression in the dataset; 2:remove all PSMs mapping to more than one GENE in the fasta file) best_psm: true # keep the best PSM only (highest summed TMT intensity) among all redundant PSMs within the same LC-MS run prot_exclude: none # exclude proteins with specified tags at the beginning of the accession number (e.g. none: no exclusion; sp|,tr| : exclude protein with sp| or tr|) allow_overlabel: true # allow PSMs with TMT on S (when overlabeling on S was allowed in the database search) allow_unlabeled: true # allow PSMs without TMT tag or acetylation on the peptide n-terminus mod_tag: none # PTM info for generation of PTM-specific reports (none: for Global data; S[167],T[181],Y[243]: for Phospho; K[170]: for K-Acetyl) min_site_prob: -1 # site localization confidence threshold (-1: for Global; 0: as determined by the search engine; above 0 (e.g. 0.75): PTMProphet probability, to be used with phosphorylation only) ms1_int: true # use MS1 precursor ion intensity (if true) or MS2 summed TMT reporter ion intensity (if false) as part of the reference sample abundance estimation top3_pep: true # use top 3 most intense peptide ions as part of the reference sample abundance estimation print_RefInt: false # print individual reference sample abundance estimates for each multiplex in the final reports (in addition to the combined reference sample abundance estimate) add_Ref: -1 # add an artificial reference channel if there is no reference channel (-1: don't add the reference; 0: use summation as the reference; 1: use average as the reference; 2: use median as the reference) max_pep_prob_thres: 0 # the threshold for maximum peptide probability min_ntt: 0 # minimum allowed number of enzymatic termini aggregation_method: 0 # the aggregation method from the PSM level to the specified level (0: median, 1: weighted-ratio)

How was this issue previously solved?

Thanks!