CongLabCode / DPAM

A domain parser for Alphafold models
32 stars 8 forks source link

Running DPAM on a very large input - stuck on step 3 #13

Open maiskhoury opened 9 months ago

maiskhoury commented 9 months ago

Hi CongLab!, I am curently trying to use you'r tool! I tried at first using it on a test directory with 15 proteins, I provided the json and pdb files as needed and it worked as it should. now I am trying to run it on a very large input 20200 proteins and I keep getting stuck at step 3. Is there a limit to the test size ? or is there any other issues with running dpam on such a large test? I keep running on this error (taken from the docker log file): "Traceback (most recent call last): File "/opt/DPAM/scripts/run_step3.py", line 100, in time.sleep(1) NameError: name 'time' is not defined"

and before gettig this error I get:

step 1 and step 2 are done as they should I got the log files with done.

thankyou!

jzhan6 commented 8 months ago

Apologies for the delayed response. It seems the error might have occurred because hhsearch did not complete for one of the queries. Identifying the specific query that caused hhsearch to fail would be beneficial in resolving this issue.

Best, Jing Zhang

maiskhoury commented 8 months ago

Hi, I got back to the same problem after succeeding a couple of times. I am still not knowing why I keep having this problem. the last run that I tried and got stuck at step3 was trying to run DPAM on 100 proteins. step1 and step2 were done step 3 got stuck on the first protein of the list (as I saw on the log file). this is what I got:

"12:01:02.395 INFO: Search results will be written to step1/RBP_sub_1/A0AV96.hhr

Killed Traceback (most recent call last): File "/opt/DPAM/scripts/run_step3.py", line 100, in time.sleep(1) NameError: name 'time' is not defined " what could be the problem with the hhsearch? thank you!