ZhonghuiGu / HEAL

41 stars 5 forks source link

esm1b only supports input sequence lengths up to 1024, how to handle proteins with sequence lengths greater than 1024 #8

Open MiJia-ID opened 8 months ago

MiJia-ID commented 8 months ago

Hi author, we are trying to build our own dataset but found that esm1b only supports input sequence length up to 1024, we would like to ask how you guys deal with protein sequences with length greater than 1024 when building the dataset?

ZhonghuiGu commented 8 months ago

Hello. The dataset built by DeepFRI does not contain sequences longer than 1000.