Closed AlexKurek closed 1 year ago
Thanks for reporting. I've added rclone to the recipe for the next release.
Later there is another crash, but I dont know why it happens:
Symlink ./SOLSDIR/L658346_SB001_uv_avg_12C2BC993t_129MHz.pre-cal.ms/killMS.DDS3_full_smoothed.sols.npz already exists, recreating
Successful readonly open of default-locked table L658346_SB001_uv_avg_12C2BC993t_121MHz.pre-cal.ms/OBSERVATION: 31 columns, 1 rows
../4C29.30.ds9.reg
[130.00975000deg,29.81742500deg]
Correcting boxfile for the local north
Using these observations ['L658346']
Traceback (most recent call last):
File "/opt/lofar/ddf-pipeline/scripts/sub-sources-outside-region.py", line 585, in <module>
DOut=SummaryToVersion("summary.txt")
File "/opt/lofar/ddf-pipeline/scripts/sub-sources-outside-region.py", line 574, in SummaryToVersion
l=L[iLine]
IndexError: list index out of range
- 21:09:21 - ClearSHM | Clear shared memory
- 21:09:21 - Multiprocessing | reaping 70 shared memory objects associated with 70 dead DDFacet processes
- 21:09:21 - ClearSHM | Clear Semaphores
- 21:09:21 - ClearSHM | Clear shared dictionaries
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB404_uv.pre-cal_12D524E44t_154MHz.pre-cal.ms 0.2452661179385455
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB413_uv.pre-cal_12D524E44t_156MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB413_uv.pre-cal_12D524E44t_156MHz.pre-cal.ms 0.26963945066386985
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB423_uv.pre-cal_12D524E44t_158MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB423_uv.pre-cal_12D524E44t_158MHz.pre-cal.ms 0.2744160681578087
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB432_uv.pre-cal_12D524E44t_160MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB432_uv.pre-cal_12D524E44t_160MHz.pre-cal.ms 0.17283014508614727
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB442_uv.pre-cal_12D524E44t_162MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB442_uv.pre-cal_12D524E44t_162MHz.pre-cal.ms 0.19027707730074406
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB452_uv.pre-cal_12D524E44t_164MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB452_uv.pre-cal_12D524E44t_164MHz.pre-cal.ms 0.24146511168365292
Successful readonly open of default-locked table /storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB461_uv.pre-cal_12D524E44t_166MHz.pre-cal.ms: 25 columns, 6578850 rows
/storage/akurek/extractPy/4C29.30_timeavg1/4C29.30/P129+29/L693959_SB461_uv.pre-cal_12D524E44t_166MHz.pre-cal.ms 0.5271044103452731
[91m============================= Running subtraction =============================[0m
[92mRunning: sub-sources-outside-region.py --timeavg=1 --overwriteoutput --ncpu=28 -b ../4C29.30.ds9.reg -p 4C29.30[0m
[91mFAILED to run sub-sources-outside-region.py --timeavg=1 --overwriteoutput --ncpu=28 -b ../4C29.30.ds9.reg -p 4C29.30: return value is 1[0m
Traceback (most recent call last):
File "/opt/lofar/ddf-pipeline/scripts/extraction.py", line 116, in <module>
run(executionstr,database=False)
File "/opt/lofar/ddf-pipeline/utils/auxcodes.py", line 68, in run
die('FAILED to run '+s+': return value is '+str(retval),database=database)
File "/opt/lofar/ddf-pipeline/utils/auxcodes.py", line 51, in die
raise RuntimeError(s)
RuntimeError: FAILED to run sub-sources-outside-region.py --timeavg=1 --overwriteoutput --ncpu=28 -b ../4C29.30.ds9.reg -p 4C29.30: return value is 1
rclone
has been added in the latest release.
Using
lofar_sksp_v4.1.0_x86-64_generic_ddf_cuda.sif
Im getting:It seems easy to fix by adding
rclone
package to the container