automl / HpBandSter

a distributed Hyperband implementation on Steroids
BSD 3-Clause "New" or "Revised" License
610 stars 109 forks source link

The bash file from Sphinx documentation of hpbandster used in example 4 does not work properly. #24

Open gmatilde opened 6 years ago

gmatilde commented 6 years ago
# submit via qsub -t 1-4 -q test_core.q example_4_cluster_submit_me.sh

#$ -cwd
#$ -o $JOB_ID-$TASK_ID.o
#$ -e $JOB_ID-$TASK_ID.e

# enter the virtual environment
source ~sfalkner/virtualenvs/HpBandSter_tests/bin/activate

if [ $SGE_TASK_ID -eq 1]
   then python3 example_4_cluster.py --run_id $JOB_ID --nic_name eth0 --working_dir .
else
   python3 example_4_cluster.py --run_id $JOB_ID --nic_name eth0  --working_dir . --worker
fi
sfalkner commented 6 years ago

Hey Matilde, I assume, that the jobs fail because they cannot source the virtual environment, as this is hard coded to my setup. It also has the nic_names for our cluster hard coded in, so it isn't as general as it could be. Do you think it would be enough to emphasize in the documentation that this script is only meant as a reference, but won't actually work for anybody out of the box?

gmatilde commented 6 years ago

Hey Stefan,

of course I was not using your virtual environment but the one for my setup, so that can not be the problem. Indeed the jobs were failing because no worker could be found and not because it could not source the virtual environment. I think it is quite clear that the documentation is just a reference and has to be adapted to your setup, but adding a warning statement does not hurt in general.