FoldingAtHome / fah-issues

49 stars 9 forks source link

Not continuing fold after hibernation and downloading loads of new projects without folding #1704

Open Gerriton opened 1 year ago

Gerriton commented 1 year ago

name: Reporting An Issue about: Create a report about a bug in the Folding@home Software. title: '' labels: '' assignees: ''


Your issue may already be reported! Please search on the issue tracker before creating one.

Your Environment

F@H Software version:

Operating System:

Browser: not using one


Expected Behavior

When I go into power-saving-mode/PC-hibernation it should continue folding the project after waking the PC up, same as when restarting it (when it does work).


Current Behavior

After waking the PC up, the log shows an error-message

22:23:39:ERROR:Receive error: 4: Interrupted system call
22:23:39:WARNING:WU01:FS01:Detected clock skew (2 hours 02 mins), I/O delay, laptop hibernation or other slowdown noted, adjusting time estimates`
22:23:40:WU01:FS01:0x22:An exception occurred at step 996768: Error invoking kernel: CUDA_ERROR_LAUNCH_FAILED (719)
22:23:40:WU01:FS01:0x22:ERROR:98: Attempting to restart from last good checkpoint by restarting core.
22:23:40:WU01:FS01:0x22:Folding@home Core Shutdown: CORE_RESTART
22:23:46:WU01:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
22:23:46:WU01:FS01:Starting
22:23:46:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit/22-0.0.20/Core_22.fah/FahCore_22 -dir 01 -suffix 01 -version 706 -lifeline 1724 -checkpoint 30 -cuda-device 0 -gpu-vendor nvidia -gpu -1 -gpu-usage 100
22:23:47:WU01:FS01:0x22:Digital signatures verified
22:23:47:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
22:23:47:WU01:FS01:0x22:Version 0.0.20
22:23:47:WU01:FS01:0x22:  Checkpoint write interval: 25000 steps (2%) [50 total]
22:23:47:WU01:FS01:0x22:  JSON viewer frame write interval: 12500 steps (1%) [100 total]
22:23:47:WU01:FS01:0x22:  XTC frame write interval: 20000 steps (1.6%) [62 total]
22:23:47:WU01:FS01:0x22:  Global context and integrator variables write interval: disabled
22:23:47:WU01:FS01:0x22:No -opencl-device specified; using deprecated -gpu argument as an alias for -opencl-device.
22:23:47:WU01:FS01:0x22:Please consider upgrading your client version.
22:23:47:WU01:FS01:0x22:There are 3 platforms available.
22:23:47:WU01:FS01:0x22:Platform 0: Reference
22:23:47:WU01:FS01:0x22:Platform 1: CPU
22:23:47:WU01:FS01:0x22:Platform 2: CUDA
22:23:47:WU01:FS01:0x22:  cuda-device 0 specified
22:23:47:WU01:FS01:0x22:opencl-device was set but OpenCL platform could not be found.
22:23:50:WU01:FS01:0x22:Attempting to create CUDA context:
22:23:50:WU01:FS01:0x22:  Configuring platform CUDA
22:23:50:WU01:FS01:0x22:Failed to create CUDA context:
22:23:50:WU01:FS01:0x22:Error initializing CUDA: CUDA_ERROR_UNKNOWN (999) at /home/conda/feedstock_root/build_artifacts/openmm_1640541150081/work/platforms/cuda/src/CudaContext.cpp:139
22:23:50:WU01:FS01:0x22:ERROR:125: Failed to create a GPU-enabled OpenMM Context.
22:23:50:WU01:FS01:0x22:Saving result file ../logfile_01.txt
22:23:50:WU01:FS01:0x22:Saving result file science.log
22:23:50:WU01:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
22:23:50:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
22:23:50:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:18213 run:21477 clone:1 gen:21 core:0x22 unit:0x000000010000001500004725000053e5
22:23:50:WU01:FS01:Uploading 4.23KiB to 206.223.170.146
22:23:50:WU01:FS01:Connecting to 206.223.170.146:8080
22:23:51:WU01:FS01:Upload complete
22:23:51:WU01:FS01:Server responded WORK_ACK (400)
22:23:51:WU01:FS01:Cleaning up
22:23:51:WU00:FS01:Connecting to assign1.foldingathome.org:80
22:23:52:WU00:FS01:Assigned to work server 34.72.228.44
22:23:52:WU00:FS01:Requesting new work unit for slot 01: gpu:1:0 GA106 [GeForce RTX 3060 Lite Hash Rate] from 34.72.228.44
22:23:52:WU00:FS01:Connecting to 34.72.228.44:8080
22:23:52:WU00:FS01:Downloading 5.74MiB
22:23:54:WU00:FS01:Download complete
22:23:54:FS01:Paused
22:23:54:WU00:FS01:Received Unit: id:00 state:DOWNLOAD error:NO_ERROR project:18719 run:0 clone:346 gen:48 core:0x22 unit:0x0000015a000000300000491f00000000
22:23:54:WU00:FS01:Starting
22:23:54:WU00:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /var/lib/fahclient/cores/cores.foldingathome.org/lin/64bit/22-0.0.20/Core_22.fah/FahCore_22 -dir 00 -suffix 01 -version 706 -lifeline 1724 -checkpoint 30 -cuda-device 0 -gpu-vendor nvidia -gpu -1 -gpu-usage 100
22:23:54:WU00:FS01:Started FahCore on PID 172198
22:23:54:WU00:FS01:Core PID:172202
22:23:54:WU00:FS01:FahCore 0x22 started
22:23:55:FS01:Shutting core down

Possible Solution (Optional)


Steps To Reproduce

  1. start a project
  2. hibernate the pc
  3. wake it up again -> it starts downloading a new project every few seconds -> you loose your bonus because you have many faulty projects within seconds

Context