Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.
Would you be able to add licenses for the datasets?
In particular, the pre-trained models may themselves require licenses, depending on the dataset(s) they were trained on?
The models are trained on Pfam, which is released under LGPL. I'm not sure about the other datasets. Academically, the appropriate thing to do is to cite the corresponding paper for each dataset.
Would you be able to add licenses for the datasets? In particular, the pre-trained models may themselves require licenses, depending on the dataset(s) they were trained on?