ilovesoup / hyracks

Automatically exported from code.google.com/p/hyracks
Apache License 2.0
0 stars 0 forks source link

Sporadic NC startup failures #133

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
There are sporadic cases which fail the NC due to:

********************************************
Starting NC with command /home/yingyib/pregelix4/bin/pregelixnc -cc-host 
128.195.11.31 -cc-port 1099 -cluster-net-ip-address 128.195.11.80 
-data-ip-address 128.195.11.80 -result-ip-address 128.195.11.80 -node-id 
sensorium-50 -iodevices /scratch/yingyib/t1

java.lang.NullPointerException
    at edu.uci.ics.hyracks.control.cc.work.RegisterNodeWork.doRun(RegisterNodeWork.java:74)
    at edu.uci.ics.hyracks.control.common.work.SynchronizableWork.run(SynchronizableWork.java:32)
    at edu.uci.ics.hyracks.control.common.work.WorkQueue$WorkerThread.run(WorkQueue.java:116)

However, at the CC side, if I check the adminconsole, the node is shown on the 
NC list.
Registered Node Count: 25
sensorium-50    0   0.0 Details
sensorium-49    0   0.0 Details

The registered node counting does not include the failed NCs.

Original issue reported on code.google.com by buyingyi@gmail.com on 12 Mar 2014 at 9:23