nadavbra / protein_bert

475 stars 98 forks source link

Missing MajorPTMs train CSV file #94

Open YanjingLiLi opened 2 months ago

YanjingLiLi commented 2 months ago

Hi authors, I think the proteinBert benchmark data folder misses the train CSV for phophositePTMs (https://github.com/nadavbra/protein_bert/tree/master/protein_benchmarks). Can you help check this? Thanks.

ddofer commented 2 months ago

As I recall, it was too large to upload on github. We uploaded it elsewhere (maybe in the paper's supplementary?). REgardless, you can download the dataset's source from phosphoPTM. I think we just did a random split then .