codeine-cd / codeine

Codeine is a continuous deployment tool that makes product deployment faster and safer through automation and monitoring.
Apache License 2.0
28 stars 7 forks source link

restart pool master command failed on sc_start #155

Closed duduamar closed 9 years ago

duduamar commented 9 years ago

http://codeine.intel.com:12347/codeine/view/main/project/design_pool_masters/command/restart/73/status

running command 'restart' on 1 node by damar nodes list: sc_start ===> executed on node: sc_start, output below <=== credentials = root $ /bin/sh -xe /tmp/codeine_1077739338.sh

executing: /nfs/site/gen/adm/netbatch/binaries/nbadmin pool status --ta localhost 2>&1 | /usr/bin/head -15 Starting Pool Master with /nfs/site/gen/adm/netbatch/binaries/startPoolMaster executing: /nfs/site/gen/adm/netbatch/binaries/startPoolMaster Starting Pool Master. Startup sequence is being logged to /var/netstar/log/PoolMaster.scynbm24.startup. Pool Master is started executing: /nfs/site/gen/adm/netbatch/binaries/nbadmin pool status --ta localhost 2>&1 | /usr/bin/head -15 ===> error in node sc_start message: Premature EOF <=== finished! failed nodes: sc_start =========> aggregate-command-statistics (success/total): 0/1

The pool was restarted successfully, it was just the command appeared as failed.

oshai commented 9 years ago

It looks like something (external) killed the peer. the logs are only from a later time now, so I am closing it. let me know if you see it again.