Closed agarrubio closed 2 years ago
Thanks for flagging. This should be handled with a proper error message.
The problem here is that this fasta file does not contain an MSA, and therefore is not meant for input to MSATransformer. Things go awry in the MSABatchConverter. We should just raise an error since the whole extract.py
script is written for efficient batched computation of single-sequence language models.
NOTE: if this is not a bug report, please use the GitHub Discussions for support questions (How do I do X?), feature requests, ideas, showcasing new applications, etc.
Bug description extract.py creates strange and very long filenames
Reproduction steps
Expected behavior Filenames that do not break a linux OS
Logs Please paste the command line output:
Additional context Add any other context about the problem here. (like proxy settings, network setup, overall goals, etc.) OS: Linux Mint 20.1 pwd: /home/alejandro/Downloads/esm torch : pytorch 1.7.0 py3.8_cuda10.1.243_cudnn7.6.3_0 pytorch