OpenBioML / protein-lm-scaling

Other
55 stars 14 forks source link

Protein Gym Eval script, gitignore #5

Closed Muedi closed 1 year ago

Muedi commented 1 year ago

Hi,

as discussed, here is a basic start for the proteingym eval script :) As is it runs on a rtxx 4090 but the VRAM is full.

The script has a header with a discription.

Things to do/discuss downstream:

pascalnotin commented 1 year ago

Lgtm for the base ProteinGym supervised eval script!

jamaliki commented 1 year ago

If I may make a small change, the following are the HuggingFace Esm tokens:

['<cls>', '<pad>', '<eos>', '<unk>', 'L', 'A', 'G', 'V', 'S', 'E', 'R', 'T', 'I', 'D', 'P', 'K', 'Q', 'N', 'F', 'Y', 'M', 'H', 'W', 'C', 'X', 'B', 'U', 'Z', 'O', '.', '-', '<null_1>', '<mask>']

So we want to change "[CLS]" and "[EOS]"

Muedi commented 1 year ago

Thanks, I'll change it in the evening :)