KosinskiLab / AlphaPulldown

https://doi.org/10.1093/bioinformatics/btac749
GNU General Public License v3.0
226 stars 49 forks source link

Generating the MSAs by Colabfold #111

Closed Pooryamb closed 1 year ago

Pooryamb commented 1 year ago

Hi,

Thanks for your amazing project. I have generated the MSA profile for a set of proteins using ColabFold and want to use your program's pulldown mode on them. I was wondering what the needed parameters are for running the ColabFold locally. More specifically, should we use env_db for MSA generation? Or do we just need to use UniRef for making the profiles?

I have generated the MSA profiles by selecting the env_db database. I got two a3m files for my proteins named:

Which a3m file should be used for downstream steps?

Thanks in advance

dingquanyu commented 1 year ago

Hi @Pooryamb,

Thanks for your interest in our package but unfortunately, how to run colabfold locally is beyond the scope of this package. Please refer to colabfold's page for detailed istructions: https://github.com/sokrypton/ColabFold

I have installed mmseqs, colaba_search, and colabfold databases locally myself then it finished with 1 a3m for each protein. I think the env_db directories are created if you choose to run mmseqs2 remotely using their server. When you run remotely, AlphaPulldown will take care of the a3m files and generate features automatically for you. When I myself run colab_search locally, I did not see these files or env_db

I hope this may be helpful.

Cheers

dingquanyu commented 1 year ago

Hi, just writing to check if this has been solved?

Pooryamb commented 1 year ago

Thanks for the follow up. I generated my MSAs using both EnvDB and UniRef. I don't know which database is used by the ColabFold by default, when it predicts PPI. Yet, as selecting the EnvDB maskes the MSAs more comprehensive, I used UniRef too. Yet, I am a little bit confused. Because ColabFold first generates the MSA by searching against UniRef, and then against EnvDB. So, I expected to see Env MSAs are larger than UniRef alignments. Yet, the file sizes tell the opposite. I have asked my question on github of ColabFold. Waiting for their reply. Link to my question As my question is more related to ColabFold, feel free to close this thread.

dingquanyu commented 1 year ago

Hi,

Really sorry I'm not familiar with the databases used by ColabFold and I couldn't help you with this here. AlphaPulldown only integrated their codes as an alternative. I hope colabfold guys will provide you with more detailed explanations.

Cheers