MannLabs / alphapeptdeep

Deep learning framework for proteomics
Apache License 2.0
105 stars 20 forks source link

Redundant protein names for one precursor entry in the spectral library #53

Closed jalew188 closed 1 year ago

jalew188 commented 2 years ago

If a protein sequence contains a same peptide at different positions, the peptide/precursor entry will include the redundant protein ids/names. For example:

prot_1 ABCKABCK prot_2 ABCK

The 'ABCK' entry will have protein_idxes '0;0;1' and protein names 'prot_1;prot1;prot_2'. The correct ones should be '0;1' and 'prot_1;prot2'.

This has been fixed in the development branch of alphabase: 6f8f333f1aa6e2c661b8b58ab65845a278ea75c1. And it will be released soon for pypi and installers.

jalew188 commented 1 year ago

Solved in alphabase-0.1.4 and alphabase main branch