snap-stanford / SATURN

MIT License
108 stars 17 forks source link

Question: protein embedding, are we only using protein sequences? #50

Closed YaoLi3 closed 7 months ago

YaoLi3 commented 7 months ago

Hello,

Thanks again for the awesome framework!

I have a question about protein embedding, are we using amino acid sequences to generate protein embeddings, or we firstly use sequences to predict protein structures then create protein embeddings?

Thanks in advance, Yao

Yanay1 commented 7 months ago

We use a protein language model (like ESM2) to generate embeddings. It takes amino acid sequences as an input and produces an embedding. It does not take in structure as an input, however, it can be used to predict structures.

YaoLi3 commented 7 months ago

Thanks a lot for your swift reply!