Closed Sugar1y closed 2 years ago
I am assuming you are running run_VIGOR.sh. This seems to work well as it runs for a whole epoch and the evaluation result makes sense. Could you give me more details about what happens next? I did not have this kind of issue before.
Hi, could you provide more information about this issue? Which script did you run? Does it work now? If no response, I am assuming it has been resolved and I will close this issue.
Thank you for your reply.
I tried your code, but after iterating once, it got stuck in the second iteration, there was no error, the background was still running, was the process locked?
---Original--- From: @.> Date: Sat, Jun 18, 2022 01:48 AM To: @.>; Cc: @.**@.>; Subject: Re: [Jeff-Zilence/TransGeo2022] get stuck after one trainingiteration (Issue #3)
Hi, could you provide more information about this issue? Which script did you run? Does it work now? If no response, I am assuming it has been resolved and I will close this issue.
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>
Thank you for your reply. I tried your code, but after iterating once, it got stuck in the second iteration, there was no error, the background was still running, was the process locked? … ---Original--- From: @.> Date: Sat, Jun 18, 2022 01:48 AM To: @.>; Cc: @.**@.>; Subject: Re: [Jeff-Zilence/TransGeo2022] get stuck after one trainingiteration (Issue #3) Hi, could you provide more information about this issue? Which script did you run? Does it work now? If no response, I am assuming it has been resolved and I will close this issue. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>
Could you please tell me what command did you use? It is hard to figure out the issue given the limited information. This is for one epoch, not iteration. You may kill the process and try again.
Thankyou for your amazing work, I had a problem trying the code, the network model seemed to get stuck after one training iteration, did the author have such a problem? Program did not report an error, the process is still occupied memory running.