Balanced classifiers, and accuracy at higher temperatures

ievapudz / TemStaPro

TemStaPro - a program for protein thermostability prediction using sequence representations from a protein language model.

MIT License

46 stars 9 forks source link

Hello, thanks for the feedback about the work!

Regarding balanced training sets (TemStaPro-Major), we did not train our final classifiers with balanced sets, since we intended to include more data points in the training process.

I am not sure I understand what is meant by 'training test sets'. All data sets that were used to train, validate, and test the classifiers were uploaded to Zenodo system. If something still seems to be missing, please do let me know.

We do have accuracy metrics for the classifiers of upper thresholds - they were computed recently, next week the preprint in "bioRxiv" should be updated with the new scores.

ievapudz / TemStaPro

Balanced classifiers, and accuracy at higher temperatures #7