Closed Peigen-L closed 3 years ago
Hi Peigen, We have updated our gadi install scripts for the upcoming 2.11 release of underworld. Please see this example script https://github.com/underworldcode/underworld2/blob/v2.11_release/docs/install_guides/nci_gadi/gadi.sh
We also have a machine accessible installation available at /g/data/m18/codes/
on gadi.
Thanks for this Julian. Is "a machine accessible installation available" means I can directly use the sample.pbs for model running with some changes in only PBS options, foodar.py and model name?
@Peigen-L Yes you should be able to use sample.pbs
. Let me know if you have issues with it.
Please do not change any pip models in the virtualenv. If you need custom python packages then install them in your own directory space and use PYTHONPATH to reference them.
@julesghub Hi Julian. Thank you for the sharing. I have run a small testing model using the sample.pbs And on testing model I got a error message from Gadi: /local/spool/pbs/mom_priv/jobs/25158339.gadi-pbs.SC: line 12: /g/data/m18/codes/UWGeodynamics_2.10.0.sh: No such file or directory And this is the PBS I used:
#!/bin/bash
#PBS -P jq14
#PBS -q normal
#PBS -l walltime=48:00:00
#PBS -l mem=100GB
#PBS -l jobfs=100GB
#PBS -l ncpus=128
#PBS -l software=underworld
#PBS -l wd
#PBS -l storage=gdata/jq14
source /g/data/m18/codes/UWGeodynamics_2.10.0.sh
export PYTHONPATH=/home/561/lp5029/.local/lib/python3.9/site-packages:/g/data/jq14/lp5029/codes/UWGeodynamics_2.9.6/lib/python3.9/site-packages
MODELNAME="3D_testing"
OUTPUTPATH=`pwd`
SCRIPT="/scratch/jq14/lp5029/test/Upper_smaller_model.py"
export OPENBLAS_NUM_THREADS=1
# execution
mpiexec python3 ./$SCRIPT 1> $OUTPUTPATH/$MODELNAME.$PBS_JOBID.log 2> $OUTPUTPATH/$MODELNAME.$PBS_JOBID.err
And can I source the m18 folder under /g/data since I am not in the group? Please help
Regards Peigen
Change this line
#PBS -l storage=gdata/jq14
to
#PBS -l storage=gdata/m18
That will give your job's compute nodes access to the /g/data directory. You should have read/execute access to it.
After change the line to storage=gdata/m18. The error remains the same as: /local/spool/pbs/mom_priv/jobs/25162236.gadi-pbs.SC: line 12: /g/data/m18/codes/UWGeodynamics_2.10.0.sh: No such file or directory. And I can't find the folder m18 under /g/data/ Please help
Can you run
source /g/data/m18/codes/UWGeodynamics_2.10.0.sh
from the commandline?
Does the virtualenv correctly start up? You can check with pip list
, it should contain underworld
and UWGeodynamics
Sorry, I can't source this file from my side. bash: /g/data/m18/codes/UWGeodynamics_2.10.0.sh: No such file or directory
Oh I recall now, we created a group on Gadi called underworld
. And only users within that group have read/execute access.
Either we:
1) Get you into that group.
2) you can use the install scripts gadi.sh
in the github link I sent you before.
Can you added me into the group?
I'll need your gadi username.
I'll need your gadi username.
lp5029
Request for NCI admin to add you to underworld
group made.
Thanks!
Request for underworld group has been made. Please approve it. Thank you.
@julesghub Hi, Julian. Thanks for the invitation on Gadi. I have joined in the software group underworld on NCI. However, I still have trouble in sourcing the folder m18:
source /g/data/m18/codes/UWGeodynamics_2.10.0.sh
bash: /g/data/m18/codes/UWGeodynamics_2.10.0.sh: No such file or directory
This is the information I got from NCI experts:
hello luo,
i cannot see you are part of the m18 project group. please go here to join the group: https://my.nci.org.au/mancini/project/m18/join
and then let the CI of the project know so they can approve your request.
subsequently you will need to logout and login to see project m18 in your /g/data directory.
regards, javed
Should I request again to join m18: Instabilities in the convecting mantle and lithosphere?
@julesghub Hi, Julian. Thanks again for letting me in to the group m18.
I can source the files in m18 now and virtualenv start up properly like: (UWGeodynamics_2.10.2) [lp5029@gadi-login-09 test]$
By checking the pip list
I can see underworld 2.10.1b0
and UWGeodynamics 2.10.2
in the package list.
However, when I am using sample.pbs to start running the model I got some error message from Gadi like:
Traceback (most recent call last):
File "/g/data/m18/codes/UWGeodynamics_2.10.2/lib/python3.7/site-packages/UWGeodynamics/__init__.py", line 5, in <module>
import underworld
File "/home/561/lp5029/.local/lib/python3.9/site-packages/underworld/__init__.py", line 52, in <module>
import h5py as _h5py
File "/home/561/lp5029/.local/lib/python3.9/site-packages/h5py/__init__.py", line 25, in <module>
from . import _errors
ImportError: cannot import name '_errors' from 'h5py' (/home/561/lp5029/.local/lib/python3.9/site-packages/h5py/__init__.py)
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "./Upper_smaller_model.py", line 7, in <module>
import UWGeodynamics as GEO
File "/g/data/m18/codes/UWGeodynamics_2.10.2/lib/python3.7/site-packages/UWGeodynamics/__init__.py", line 8, in <module>
raise ImportError("Can not find Underworld, please check your installation")
ImportError: Can not find Underworld, please check your installation
And this is the PBS I used:
#!/bin/bash
#PBS -P jq14
#PBS -q normal
#PBS -l walltime=48:00:00
#PBS -l mem=100GB
#PBS -l jobfs=100GB
#PBS -l ncpus=128
#PBS -l software=underworld
#PBS -l wd
#PBS -l storage=gdata/jq14
source /g/data/m18/codes/UWGeodynamics_2.10.0.sh
export PYTHONPATH=/home/561/lp5029/.local/lib/python3.9/site-packages:/g/data/jq14/lp5029/codes/UWGeodynamics_2.9.6/lib/python3.9/site-packages
MODELNAME="3D_testing"
OUTPUTPATH=`pwd`
SCRIPT="/scratch/jq14/lp5029/test/Upper_smaller_model.py"
export OPENBLAS_NUM_THREADS=1
# execution
mpiexec python3 ./$SCRIPT 1> $OUTPUTPATH/$MODELNAME.$PBS_JOBID.log 2> $OUTPUTPATH/$MODELNAME.$PBS_JOBID.err
I used qsub sample.pbs
and got these error massage from Gadi log.
Please help
Peigen
get rid of
export PYTHONPATH=/home/561/lp5029/.local/lib/python3.9/site-packages:/g/data/jq14/lp5029/codes/UWGeodynamics_2.9.6/lib/python
it's causing an issue because of the h5py install in your .local
directory.
Thanks! It is working now! The testing results come out correctly!
Hello Romain or Julian,
I am trying to using Underworld2 on Gadi. With the kind help from the NCI experts, the configuration of Underworld2 has finished under the folder of:
/g/data/jq14/lp5029/codes/underworld/
On well tested 2D model I got a error message from Gadi like this:
This is the PBS file I used for initializing Underworld:
This is nci_gadi.sh file:
Is there anything I can do to get Underworld2 up and running on Gadi?
Kind Regards Peigen