Closed jon-ault closed 7 months ago
Also, if there's a problem downloading a new GPU work unit & the client has to make multiple requests to get one, the CPU work unit gets interrupted on each server request.
What's happening is that the CPU WU is adjusting to take up or give back the extra CPU that the GPU WU uses.
This should be fixed.
On my Windows 11 machine running 8.1.16, when a GPU work unit completes the CPU work unit gets interrupted & restarts. In the following log snippet, WU922 is a GPU WU that completes, and WU920 is the CPU WU that gets interrupted.
Log showing restart
``` 07:33:30:I1::WU922:Completed 2500000 out of 2500000 steps (100%) 07:33:30:I1::WU922:Average performance: 83.8835 ns/day 07:33:30:I1::WU922:Checkpoint completed at step 2500000 07:33:33:I1::WU922:Saving result file ..\logfile_01.txt 07:33:33:I1::WU922:Saving result file checkpointIntegrator.xml 07:33:33:I1::WU922:Saving result file checkpointState.xml 07:33:33:I1::WU922:Saving result file positions.xtc 07:33:33:I1::WU922:Saving result file science.log 07:33:33:I1::WU922:Saving result file xtcAtoms.csv.bz2 07:33:33:I1::WU922:Folding@home Core Shutdown: FINISHED_UNIT 07:33:34:I1::WU922:Core returned FINISHED_UNIT (100) 07:33:35:I1::Added new work unit: cpus:0 gpus:gpu:02:00:00 07:33:35:I1::WU926:Requesting WU assignment for user Jon_Ault team 35054 07:33:35:I1:OUT23:> POST https://assign1.foldingathome.org/api/assign HTTP/1.1 07:33:35:I3:Connecting to assign1.foldingathome.org:443 07:33:35:I1::WU922:Uploading WU results 07:33:35:I1::WU920:WARNING:Console control signal 1 on PID 2940 07:33:35:I1::WU920:Exiting, please wait. . . 07:33:35:I1:OUT24:> POST https://vav19.fah.temple.edu/api/results HTTP/1.1 07:33:35:I3:Connecting to vav19.fah.temple.edu:443 07:33:35:I1:OUT23:< assign1.foldingathome.org:443 HTTP/1.1 200 HTTP_OK 07:33:35:I1::WU926:Received WU assignment EYMQte3vhl9fHqBlTEONqLu9GlAgaxbsmW6yWHtoSrw 07:33:35:I1::WU926:Downloading WU 07:33:35:I1:OUT25:> POST https://ds03.scs.illinois.edu/api/assign HTTP/1.1 07:33:35:I3:Connecting to ds03.scs.illinois.edu:443 07:33:35:I1::WU920:Folding@home Core Shutdown: INTERRUPTED 07:33:36:I1::WU920:Core returned INTERRUPTED (102) 07:33:36:I3::WU920:Running FahCore: C:\ProgramData\FAHClient\cores/fahcore-a8-win-64bit-avx2_256-0.0.12/FahCore_a8.exe -dir wmJyf7aUkCkJuXiFRGgKqUA42ncVRtkAiAnOXnmIMmY -suffix 01 -version 8.1.16 -lifeline 2404 -np 6 07:33:36:I3::WU920:Started FahCore on PID 19696 07:33:37:I1::WU920:*********************** Log Started 2023-03-10T07:33:36Z *********************** 07:33:37:I1::WU920:************************** Gromacs Folding@home Core *************************** 07:33:37:I1::WU920: Core: Gromacs 07:33:37:I1::WU920: Type: 0xa8 07:33:37:I1::WU920: Version: 0.0.12 07:33:37:I1::WU920: Author: Joseph Coffland