Closed KlemensFroehlich closed 6 months ago
@KlemensFroehlich, for me it works with your file, did you select FragPipe in the dropdown menu where you uploaded the file?
Also from the error message it seems to me that you selected maxquant in this dropdown menu?
@KlemensFroehlich
KeyError: "The following 'value_vars' are not present in the DataFrame: ['B_1 Intensity', 'B_2 Intensity', 'B_3 Intensity']"
We expect to see columns B_1 B_2 and B_3 in the tsv. You have B_4, B_5, B_6.
It probably has something to-do with the experimental design setup in FragPipe. Did you name the bio/tech reps 1,2,3,4,5,6, which btw is correct since this is not a paired analysis? Proteobench, however, expects 1,2,3,1,2,3.
Good you saw this @KlemensFroehlich There are two options:
@RobbinBouwmeester I definitely selected FragPipe... the error is reproducible on my system.
ah sorry @wolski you are completely right. Now it works for me when setting the replicates to 123123. I am wodnwering how it can work for @RobbinBouwmeester?
If we are opening up the discussion on the headers of the columns:
The most natural way for me to process MS1 quant files in FragPipe would be the "By file name" asignment of the Experiment name. That is at least what I always do.
Would it be possible to include multiple options how columns can be named or would you prefer to keep it as narrowly defined as possible?
Best Klemens
@KlemensFroehlich, not sure... Hmmm now to think of it... I first executed a test on MQ to see if nothing else was broken. As a second file I uploaded your provided file. Maybe that explains it?
Easiest way to allow for these things would be to change the .toml that would mean there are multiple entries in the dropdown menu for FragPipe though. Could be a good solution if fixing column names is difficult, if it is very easy to fix column names in MSFragger, I would prefer fixing column names that.
@KlemensFroehlich
I fully agree: "The most natural way for me to process MS1 quant files in FragPipe would be the "By file name" asignment of the Experiment name. That is at least what I always do." Actually, I would do the same when using MaxQuant (can't remember how the option to use file names as sample names is called there.) But this makes setting up and running the software much easier.
Can you send me please a FragPipe result generated with this setting, please? @RobbinBouwmeester Potentially, we can change the default and, deprecate the others.
@KlemensFroehlich further testing I cannot replicate my earlier results, I think I simply selected the wrong file when it did work for me. Indeed, the file that you uploaded does not work.
@wolski Yes, I agree with you and Klemens, we can easily change the .toml. That would mean it looks more like, e.g., the maxquant.toml. Is it easy to describe the procedure of "By file name" assignment to users?
Good stuff. For the instructions, you can find it here (https://proteobench.readthedocs.io/en/latest/modules/3-DDA-Quantification-ion-level/) in the "FragPipe" section. You can either send me what to add/change, or you can directly change the corresponding file (https://github.com/Proteobench/ProteoBench/blob/main/docs/modules/3-DDA-Quantification-ion-level.md) :)
@wolski https://drive.switch.ch/index.php/s/zYwPh9HNCd2MrUZ This is with all the standard parameters (20 ppm initial search etc) of FragPipe but the format should be correct. Let me know if I can do anything else.
@RobbinBouwmeester as for the users:
we could just replace step 2: "Assign experiments in the workflow tab corresponding with the corresponding experimental condition(“A”, “B”)." with: "Following import of raw files (or mzml?), assign experiments "by File Name" right above the list of raw files
btw: Do you want to also provide the converted mzML (or mzXML, htrms, dia, ....) files for users? I dont know whether this can be a source of variance? If the procedure how some programs convert raw files to specific input format changes this might be a confounder. But I honestly think this SHOULD not be the case :D
Best, Klemens
btw: Do you want to also provide the converted mzML (or mzXML, htrms, dia, ....) files for users? I dont know whether this can be a source of variance? If the procedure how some programs convert raw files to specific input format changes this might be a confounder. But I honestly think this SHOULD not be the case :D
We discussed this and we will ignore it for this module. If we wanted to benchmark this step, we would need a dedicated module.
Good catch @wolski!!
I have amended the instructions in the documentation in PR #188
hi everyone
I just tried to upload a MSFragger results file. I followed the instructions provided in the "How to use" section for the search.
When I upload the example MSFragger output everything seems to work. I get a table, and histograms etc. When I upload my own table, I get the error message:
which brings me to the question: Are the column hardcoded in the fragpipe output?
My output seems to look different than the example output.
example output columns:
my output columns:
This happens in FragPipe 19 and 20 for me.... maybe a different philosopher version of a different MS1 quant module version number or I specified something incorrectly during the search.
Please find attached the combined_ion output.
combined_ion_FP19.zip
Best Klemens