Closed kimrutherford closed 6 years ago
Maybe it would be good to ask Steve (or someone else at biochemistry) which resource gives the best estimates? Some methods outperform other methods, but I don't know which one is regarded best/reliable.
Ask pombelist which one people would prefer ?
@Antonialock maybe you needed to provide the options?
Also asked Steve's group
I wouldn’t know whether to suggest tools or pick algorithms recommended in papers.
Q: do general users even look at the number? Q2: would anyone who cares about the number go out themselves and find an estimate based on their favourite algorithm?
Let's just pick one, (top google hit?)
what are the options, we'll decide on the call
This paper compares different prediction methods: https://academic.oup.com/bioinformatics/article/32/6/821/1744386
They recommend an SVM based machine learning algorithm published here: https://www.ncbi.nlm.nih.gov/pubmed/22326964
Pride
document method used currently.
Documentation done. We decided that will do until & unless users clamor for something different.
While working on #569 I noticed that the protein properties we show on the gene pages doesn't match the values in the PeptideStats.tsv file from PomBase V1.
There are many calculators on the web but they give quite different answers for some of the properties. So I'm not sure our values are OK.
As an example, for SPAC3G9.09c we have:
(Calculated using this Perl module: http://search.cpan.org/dist/BioPerl/Bio/Tools/pICalculator.pm )
This online calculator matches the values we have quite well (but not perfectly): http://protcalc.sourceforge.net/
This one gives somewhat different values: http://pepcalc.com/protein-calculator.php
This one: http://isoelectric.ovh.org/ gives the iso-electric point as: 5.83