I recently read the pepquery2 paper, I like many of the analyses you performed in the discussion to show its versatility, but I was left wondering about step4 (the shuffling of peptides).
If I understand correctly, you either make 10000 combinations, or all unique ones, in case the total number of combinations is lower than 10000. In my work I am seeing peptides with 4 or more I/L aminoacids, and after a cursory check of the source code I didn't find any code that takes their equal mass into effect for shuffling.
Is temporarily converting them to a "J" or some other character an option to rectify this for shuffling (and any other analyses where their equal mass matters. For example, does a reference match with an I instead of an L in the peptide or vice versa matter)?
If you already have any code aimed at handling this that I missed, could you link it? I would be very interested in reading it.
I recently read the pepquery2 paper, I like many of the analyses you performed in the discussion to show its versatility, but I was left wondering about step4 (the shuffling of peptides).
If I understand correctly, you either make 10000 combinations, or all unique ones, in case the total number of combinations is lower than 10000. In my work I am seeing peptides with 4 or more I/L aminoacids, and after a cursory check of the source code I didn't find any code that takes their equal mass into effect for shuffling.
Is temporarily converting them to a "J" or some other character an option to rectify this for shuffling (and any other analyses where their equal mass matters. For example, does a reference match with an I instead of an L in the peptide or vice versa matter)?
If you already have any code aimed at handling this that I missed, could you link it? I would be very interested in reading it.