sokrypton / ColabFold

Making Protein folding accessible to all!
MIT License
2.01k stars 504 forks source link

ValueError: attempt to get argmax of an empty sequence #269

Open melyssacheung opened 2 years ago

melyssacheung commented 2 years ago

Expected Behavior

trying to get produced relaxed models of tcr-pmhc complexes running on the terminal (prior to the update it worked)

Current Behavior

In the fasta file contains the sequences of mhc:peptide:tcr-alpha:linker:tcr-beta (in that respective order separated by ':') In the script, the only thing I changed is the max template date to 2018-04-30 instead of 2100-01-01 as that is my cutoff as AlphaFold is trained on protein chains in the PDB released before 2018-04-30.

Steps to Reproduce (for bugs)

/piercehome/cheungm/colabfold_batch/batch.py --amber --templates --num-recycle 3 --use-gpu-relax /piercehome/cheungm/FASTA_linker/Class_1/6l9l_linker.fasta /piercehome/cheungm/colabfold_batch/output_new/

ColabFold Output (for bugs)

2022-07-14 01:52:35,852 Running colabfold 1.3.0 (d9adee4372a543fc3ecd104476f5da476dc85a3a) 2022-07-14 01:52:37,498 --max-msa can not be used in combination with AlphaFold2-multimer (--max-msa ignored) WARNING: You are welcome to use the default MSA server, however keep in mind that it's a limited shared resource only capable of processing a few thousand MSAs per day. Please submit jobs only from a single IP address. We reserve the right to limit access to the server case-by-case when usage exceeds fair use.

If you require more MSAs:

Context

originally i got the same error in the now closed regarding the release date error (https://github.com/sokrypton/ColabFold/issues/267), however, after reinstalling colabfold on my cluster, leads to this new error which im unable to bypass.

Your Environment

prior to running i use a different cluster (that has more gpu's), module load gcc/cuda, and conda activate colabfold_conda

thank you in advance!

milot-mirdita commented 2 years ago

Can you share the sequence? Especially the linker sequence, since that seems to be causing something weird in the following line:

2022-07-14 01:54:39,413 Sequence 3 found templates: []

We updated the databases yesterday and are still ironing out some remaining issues.

milot-mirdita commented 2 years ago

The latest commit should fix your issue

melyssacheung commented 2 years ago

my 6l9l_linker.fasta contains:

6l9l GPHSMRYYETATSRRGLGEPRYTSVGYVDDKEFVRFDSDAENPRYEPQVPWMEQEGPEYWERITQIAKGQEQWFRVNLRTLLGYYNQSAGGTHTLQRMYGCDVGSDGRLLRGYEQFAYDGCDYIALNEDLRTWTAADMAAQITRRKWEQAGAAEYYRAYLEGECVEWLHRYLKNG:SPSYAYHQF:KTTQPDSMESTEGETVHLPCSHATISGNEYIYWYRQVPLQGPEYVTHGLQQNTTNSMAFLAIASDRKSSTLILTHVSLRDAAVYHCILQGTGSKLSFGKGAKLTVS:GSADDAKKDAAKKDGKS:VTQSPRNKVTVTGGNVTLSCRQTNSHNYMYWYRQDTGHGLRLIHYSYGAGNLQIGDVPDGYKATRTTQEDFFLLLELASPSQTSLYFCASSDGDYEQYFGPGTRLTVL

melyssacheung commented 2 years ago

unfortunately i still get the same error when I run it

melyssacheung commented 2 years ago

I just attempted to remove the linker sequence and I still obtain the same error:

2022-07-14 08:38:26,497 Running colabfold 1.3.0 (d9adee4372a543fc3ecd104476f5da476dc85a3a) 2022-07-14 08:38:27,440 --max-msa can not be used in combination with AlphaFold2-multimer (--max-msa ignored) WARNING: You are welcome to use the default MSA server, however keep in mind that it's a limited shared resource only capable of processing a few thousand MSAs per day. Please submit jobs only from a single IP address. We reserve the right to limit access to the server case-by-case when usage exceeds fair use.

If you require more MSAs:

milot-mirdita commented 2 years ago

Can you update to the latest git version? That once includes the fix for your query.

melyssacheung commented 2 years ago

just reinstalled it and it worked, thanks so much!