Open LoayJabre opened 1 year ago
If you are running Msgfplus with internal indexing, you need to pass the database with targets and decoys i.e. revcat and not adjusted.
If you are letting msgfplus add decoys, which is pretty much untested, you need to disable indexing. This is a bug in the indexing code @timosachsenberg
Thanks so much for the help!
Just to double check that I'm understanding you correctly:
If I have a database that already contains targets+decoys, it would be correct to use :
MSGFPlusAdapter -in $FILE -executable /software/MSGFPlus-022.04.18/MSGFPlus.jar -database ../sonja_data/BB40_protein_coding_genome_with_ft_ID_with_crap_nonredun.revCat.fasta -out $temp_string'.idXML' -add_decoys 'false' -fixed_modifications 'Carbamidomethyl (C)' -threads 12 -java_memory 50000
where the database has a specific path?
I'm puzzled because I've run the same script with '-database $db_string_adjusted' multiple times in the past, and it worked fine. It stopped working when I upgraded to OpenMS v2.8.
I'm also not really sure how to disable indexing
Passing any path is always fine. Should not matter if via a variable or not. It matters if this file already contains decoys or not.
Auto indexing was recently added per default, so that changed to previous versions.
Yes this command would probably correct if you had decoys in the fasta. You do not need any subsequent PeptideIndexer.
Run the tool with --helphelp and you will see all possible options including how to disable indexing.
Thank you!! It works perfectly now.
Good to hear. I keep it open for @timosachsenberg to fix the behaviour when add_decoys is active.
There are two ways to fix this:
I would prefer 1 because it is easier to implement and ensures that decoy databases are the same between e.g., different search engines
Hi - I'm running MSGF+ through OpenMS and I'm facing a persistent error that No peptides were matched to decoys in my database. When I run my script, a revCat.fasta file is generated, and when I inspect it manually, it contains decoys prefixed with 'XXX_'
My script is as follows:
I've also attached the .out file showing the error: database-searching-openmsv1_20221213.txt
When I run the same script but direct the -database to where the revCat.fasta file is found, i.e.
MSGFPlusAdapter -in $FILE -executable /software/MSGFPlus-022.04.18/MSGFPlus.jar -database ../sonja_data/BB40_protein_coding_genome_with_ft_ID_with_crap_nonredun.revCat.fasta -out $temp_string'.idXML' -add_decoys 'false' -fixed_modifications 'Carbamidomethyl (C)' -threads 12 -java_memory 50000
the DB searching works fine. Could this be an issue where there's a confusion in where the script is looking for the .revCatfasta file?I'm not very strong with coding, so any help would be appreciated!