compomics / searchgui

Highly adaptable common interface for proteomics search and de novo engines
http://compomics.github.io/projects/searchgui.html
42 stars 15 forks source link

Fasta Input With Wildcards Peptide Shaker #337

Closed kostrouc closed 2 years ago

kostrouc commented 2 years ago

Hi,

I am trying to search a fasta file in Peptide Shaker that contains wildcard characters in the sequence. Is there a recommended format that these gaps in sequences should be changed into? Attached is the format of the fasta file I am trying to search in Peptide Shaker and the error I recieve.

fastaexample ps_error
hbarsnes commented 2 years ago

If all of the wildcard characters are at the end of your protein sequences, you can probably simply remove them? As they would have no impact on the list of possible peptide sequences anyway?

If they however are inside your protein sequences (don't see any examples of this above though?), you could either remove them there as well (although this would change the interpretation of the wildcards), or you will have to modify and split up the protein sequences before giving the FASTA file to SearchGUI/PeptideShaker.

kostrouc commented 2 years ago

Thank you!

mvaudel commented 2 years ago

Hi, Wildcards should be denoted with an 'X'. Stars in sequences are often used to represent stop codons between exons. Can this be the case here? Hope it helps, Marc