ESMCI / cime

Common Infrastructure for Modeling the Earth
http://esmci.github.io/cime
Other
162 stars 207 forks source link

testlist hangs when it should exit #4635

Closed jedwards4b closed 3 months ago

jedwards4b commented 6 months ago

From Cheryl Craig:

It is looking like it may be the testing infrastructure which hangs when running a list of tests. I ran two tests using the qsub command on the following .sh files:

---- SCRIPT WITH CREATE_TEST ON SINGLE TEST -----

!/bin/bash

#

PBS -N cime-tests

PBS -q main

PBS -A P93300042

PBS -l walltime=0:30:00

PBS -l select=1:ncpus=128:mpiprocs=128

PBS -j oe

cd /glade/u/home/cacraig/cam_alpha17_externals/cime/scripts ./create_test ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s

------- OUTPUT FOLLOWS -------

derecho1$ /glade/derecho/scratch/cacraig/cs.status.20240517_102527_tfwrgq 20240517_102527_tfwrgq: 1 test ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s (Overall: FAIL) details: PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s CREATE_NEWCASE PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s XML PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s SETUP FAIL ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s SHAREDLIB_BUILD failed to initialize

----- THE ABOVE JOB EXITED PROMPTLY -------

----- SCRIPT USING TESTLIST WITH SINGLE TEST IN IT -----------

!/bin/bash

#

PBS -N cime-tests

PBS -q main

PBS -A P93300042

PBS -l walltime=0:30:00

PBS -l select=1:ncpus=128:mpiprocs=128

PBS -j oe

cd /glade/u/home/cacraig/cam_alpha17_externals/cime/scripts ./create_test --xml-category test_cac1 --xml-machine derecho --retry 2 --xml-compiler intel --queue main --test-root /glade/derecho/scratch/cacraig/test_cac1_byhandsingle --output-root /glade/derecho/scratch/cacraig/test_cac1_byhandsingle --project P93300042 --test-id test_cac1_byhandsingle --compare /glade/campaign/cesm/community/amwg/cam_baselines/cam6_3_159 --generate /glade/derecho/scratch/cacraig/test_cac1_byhandsingle/baselines

------ OUTPUT FOLLOWS -------

derecho1$ cd /glade/derecho/scratch/cacraig/test_cac1_byhandsingle/ baselines/ cs.status.test_cac1_byhandsingle ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s.GC.test_cac1_byhandsingle/ testreporter derecho1$ ./cs.status.test_cac1_byhandsingle test_cac1_byhandsingle: 1 test ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s (Overall: PEND) details: PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s CREATE_NEWCASE PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s XML PASS ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s SETUP PEND ERP_Ln9.ne30_ne30_mg17.FCnudged.derecho_intel.cam-outfrq9s SHAREDLIB_BUILD

---- THE ABOVE JOB HUNG UNTIL I KILLED IT AFTER 20 MINUTES -----------

github-actions[bot] commented 3 months ago

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] commented 3 months ago

This issue was closed because it has been stalled for 5 days with no activity.