HIPS / Spearmint

Spearmint Bayesian optimization codebase
Other
1.55k stars 329 forks source link

Spearmint on the cluster #23

Open Mehrian opened 9 years ago

Mehrian commented 9 years ago

Dear all I'm trying to use Spearmint package on the cluster and I have the following problem for running the "simple" in examples! (I have to mention that, I have tested this example on my laptop successfully!)

When I first try to execute the command "mongod --fork --logpath --dbpath" in the simple example folder, I have the following error;

about to fork child process, waiting until server is ready for connections. forked process: 108120 all output going to: /home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath log file [/home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath] exists; copied to temporary file [/home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath.2015-06-02T14-32-33] ERROR: child process failed, exited with error number 100

Jasper suggested that I delete fork in the above command, and when I tried, there is no error! So, I moved to the spearmint folder and then execute the command: "python main.py ../examples/simple" but there is an error;

Traceback (most recent call last): File "main.py", line 198, in from spearmint.utils.database.mongodb import MongoDB ImportError: No module named spearmint.utils.database.mongodb

So, as Jasper suggested again!, I installed Spearmint package one more time in the spearmint folder, using the command; "pip install -e." but nothing changes!!!

I have to mention that, I do NOT see all these errors when I log into the cluster with admin user!! (With admin user, it works properly!) So, As Jasper (Thanks for his time) suggested, I installed Anaconda on the cluster...but nothing changed! So, definitely the problem is that I am NOT the admin on the cluster! I'm wondering if anybody have any suggestion?

Best wishes, Mohammad

JasperSnoek commented 9 years ago

Hi Mohammad, sorry, I'm sure that's frustrating. I think it's probably a good idea to get in touch with your cluster admin. At the moment it looks to me like they may be preventing you from spawning processes on the cluster (through the fork command). In that case, you may want to try to launch mongodb from a separate node in the cluster (you'll have to point to that machine in spearmint) or keep one terminal open in the cluster and running mongodb without fork.

Jasper

On Tue, Jun 2, 2015 at 10:43 AM, Mehrian notifications@github.com wrote:

Dear all I'm trying to use Spearmint package on the cluster and I have the following problem for running the "simple" in examples! (I have to mention that, I have tested this example on my laptop successfully!)

When I first try to execute the command "mongod --fork --logpath --dbpath" in the simple example folder, I have the following error;

############################ about to fork child process, waiting until server is ready for connections. forked process: 108120 all output going to: /home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath log file [/home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath] exists; copied to temporary file [/home/mehrianmohammad/Spearmint-master/examples/simple/--dbpath.2015-06-02T14-32-33] ERROR: child process failed, exited with error number 100 ##############################

Jasper suggested that I delete fork in the above command, and when I tried, there is no error! So, I moved to the spearmint folder and then execute the command: "python main.py ../examples/simple" but there is an error;

###################### Traceback (most recent call last): File "main.py", line 198, in from spearmint.utils.database.mongodb import MongoDB ImportError: No module named spearmint.utils.database.mongodb #######################

So, as Jasper suggested again!, I installed Spearmint package one more time in the spearmint folder, using the command; "pip install -e." but nothing changes!!!

I have to mention that, I do NOT see all these errors when I log into the cluster with admin user!! (With admin user, it works properly!) So, As Jasper (Thanks for his time) suggested, I installed Anaconda on the cluster...but nothing changed! So, definitely the problem is that I am NOT the admin on the cluster! I'm wondering if anybody have any suggestion?

Best wishes, Mohammad

— Reply to this email directly or view it on GitHub https://github.com/HIPS/Spearmint/issues/23.

Mehrian commented 9 years ago

Hi Jasper Thanks for your answers! Actually, the admin is my friend and we go through all these things with together! but he doesn't have any idea! (He is not an expert with these stuff!) Also, the cluster that we use and we run all our simulations is not that big and there are just two or three users! (Let’s say it is a big computer;) ) We don't have something called "fork" on the cluster and as the admin said, nothing is preventing me from anything! The only thing is that I'm not the root... Best Regards, Mohammad