Open flowers9 opened 1 year ago
~I assume this issue refers to extracting embeddings and/or predicting structures?~ I see now that this refers to esm/scripts/esmfold_inference.py
.
This proposal makes sense; we'd welcome a PR!
EDIT: I think what you want to achieve could simply be name = header.split()[0]
Sometimes fasta files have simple header lines, but sometimes they toss in a bunch of stuff, for example:
This is not a good name for the output file, and can make the logs a bit wordy, too. I'd suggest trimming it down to the first whitespace delimited word. In esm/scripts/esmfold_inference.py:
And then use {name} instead of {header} as appropriate below.