GeoscienceAustralia / eqrm

Automatically exported from code.google.com/p/eqrm
Other
5 stars 4 forks source link

Scenario Ground Motion simulation crashing when run in parallel #67

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Scenario Ground Motion simulation crashing when run in parallel

On node 8, rhe-compute1.ga.gov.au wall time taken overall 0:00:14.527527 
hr:min:                 sec.
wall_time_taken_overall_seconds = 14.5275270939

Traceback (most recent call last):
  File "ScenGM_controlfile.py", line 83, in <module>
    main(locals())
  File "/nas/mnh/georisk_models/earthquake/sandpits/drobinson/eqrm_google/trunk/                 eqrm_core/eqrm_code/analysis.py", line 732, in main
    block_indices)
  File "/nas/mnh/georisk_models/earthquake/sandpits/drobinson/eqrm_google/trunk/                 eqrm_core/eqrm_code/output_manager.py", line 1335, in join_parallel_data_files
    name = base_name + FILE_TAG_DELIMITER + str(i) + '.npy'
TypeError: can only concatenate list (not "str") to list

--------------------------------------------------------------------------
mpirun has exited due to process rank 0 with PID 30703 on
node rhe-compute1.ga.gov.au exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
--------------------------------------------------------------------------
drobinso@rhe-compute1:/nas/mnh/georisk_models/earthquake/EQRM_Phillippines/Scena
rioGM$ wall_time_taken_overall_seconds = 14.5275270939
TypeError: can only concatenate list (not "str") to list

--------------------------------------------------------------------------
mpirun has exited due to process rank 0 with PID 30703 on
node rhe-compute1.ga.gov.au exiting without calling "finalize". This may
have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
--------------------------------------------------------------------------

Original issue reported on code.google.com by RobinsonDavidJ0@gmail.com on 10 Aug 2012 at 4:33

GoogleCodeExporter commented 9 years ago
This can be observed by doing;
EQRM/trunk/eqrm_core/demo/plot/plot_data$ mpirun -np 2 python2.7 plot_scen_gm.py

or 
trunk/eqrm_core/demo/plot/plot_data$ mpirun -np 2 python2.7 
little_plot_scen_gm.py

This error is likely due to issue 64 not checked for running in parallel.

Original comment by duncan.g...@gmail.com on 10 Aug 2012 at 4:51

GoogleCodeExporter commented 9 years ago

Original comment by duncan.g...@gmail.com on 10 Aug 2012 at 7:17