compomics / searchgui

Highly adaptable common interface for proteomics search and de novo engines
http://compomics.github.io/projects/searchgui.html
38 stars 16 forks source link

Fasta Input With Wildcards Peptide Shaker #337

Closed kostrouc closed 1 year ago

kostrouc commented 1 year ago

Hi,

I am trying to search a fasta file in Peptide Shaker that contains wildcard characters in the sequence. Is there a recommended format that these gaps in sequences should be changed into? Attached is the format of the fasta file I am trying to search in Peptide Shaker and the error I recieve.

fastaexample ps_error
hbarsnes commented 1 year ago

If all of the wildcard characters are at the end of your protein sequences, you can probably simply remove them? As they would have no impact on the list of possible peptide sequences anyway?

If they however are inside your protein sequences (don't see any examples of this above though?), you could either remove them there as well (although this would change the interpretation of the wildcards), or you will have to modify and split up the protein sequences before giving the FASTA file to SearchGUI/PeptideShaker.

kostrouc commented 1 year ago

Thank you!

mvaudel commented 1 year ago

Hi, Wildcards should be denoted with an 'X'. Stars in sequences are often used to represent stop codons between exons. Can this be the case here? Hope it helps, Marc