Closed galindro closed 7 years ago
Hi @galindro
If your goal is to have redundancy across AZs, you can achieve that by providing a single volume-id to PX instance and spinning up PX in different AZs. When you create volumes in PX with replication factor greater than 1, data will be autmatically be replicated across AZs. If you do want 3 EBS volumes per instance, you can do that as well, just wanted to clarify that you don't have to.
In AWS EBS volumes can only be attached on the instances in the same zones, there is definitely a bug in our handling of that. We will address that.
In the meantime, as a workaround, you can specify the EBS volume ID that matches the zone where Portworx instance is spun up, e.g bring up PX instances on zone sa-east-1c with vol-002334... as template, PX instances on zone sa-east-1b with vol-058d..as template etc.
Thanks for reporting this!
Hi @jvinod . The problem still persists if I configure portworx with the correct volume. The issue is ocurring with portworx enterprise too. Here are some details of my environment:
I've 3 docker swarm manager nodes running on each AZ from sa-east-1 region. Each of them have portworx installed (as a docker plugin) and working very well.
Each AZ has an EBS volume with 30GB size that is used as template for portworx. As you coul see bellow, they are attached to each 3 manager nodes:
pxctl status
, executed from a manager node:Status: PX is operational
Node ID: 25ca5091-4a29-4f1b-ad21-b94338be313b
IP: 10.0.2.102
Local Storage Pool: 1 pool
POOL IO_PRIORITY RAID_LEVEL USABLE USED STATUS ZONE REGION
0 LOW raid0 30 GiB 14 GiB Online b sa-east-1
Local Storage Devices: 1 device
Device Path Media Type Size Last-Scan
0:1 /dev/xvdg STORAGE_MEDIUM_MAGNETIC 30 GiB 17 Jul 17 16:18 UTC
total - 30 GiB
Cluster Summary
Cluster ID: socialbase
IP ID Used Capacity Status
10.0.2.102 25ca5091-4a29-4f1b-ad21-b94338be313b 14 GiB 30 GiB Online (This node)
10.0.1.101 7a943a7e-b9b9-43a2-b6c0-8014934672fa 14 GiB 30 GiB Online
10.0.3.103 c130fbfd-4549-4a46-a4ef-771f82d4395c 14 GiB 30 GiB Online
Global Storage Pool
Total Used : 42 GiB
Total Capacity : 90 GiB
After built the above scenario, I decided to add docker swarm worker nodes in the cluster. So, I've tried to build an ami with portworx running as a container (instead of docker plugin), but the error showed here ocurred... After reporting this, I've tried to make the same ami with portworx docker plugin and with px-enterprise docker container. None of them worked with the same error.
My last attempt was to use px-enterprise as a docker container setting up the parameters like you pointed. But, not worked too. Here are the container logs and info, instance and ami details:
root@ip-10-0-1-9:~# docker inspect portworx
[
{
"Id": "efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879",
"Created": "2017-07-18T10:50:46.063458381Z",
"Path": "/docker-entry-point.sh",
"Args": [
"-c",
"socialbase",
"-k",
"consul:http://swarmmanager.sb:8500",
"-s",
"vol-0842059ddbdbd9d7f"
],
"State": {
"Status": "exited",
"Running": false,
"Paused": false,
"Restarting": false,
"OOMKilled": false,
"Dead": false,
"Pid": 0,
"ExitCode": 0,
"Error": "",
"StartedAt": "2017-07-18T10:50:48.254479076Z",
"FinishedAt": "2017-07-18T10:58:10.923496743Z"
},
"Image": "sha256:ca9551f7c0c39dadfc66e02ec46e542d442975f3c69b937ed3b27ac05861ea07",
"ResolvConfPath": "/var/lib/docker/containers/efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879/resolv.conf",
"HostnamePath": "/var/lib/docker/containers/efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879/hostname",
"HostsPath": "/var/lib/docker/containers/efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879/hosts",
"LogPath": "/var/lib/docker/containers/efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879/efc155a39a367e5f70d854cc0b33673c33c3b5b7c824220fb9b7ed2d44af4879-json.log",
"Name": "/portworx",
"RestartCount": 0,
"Driver": "overlay2",
"MountLabel": "",
"ProcessLabel": "",
"AppArmorProfile": "unconfined",
"ExecIDs": null,
"HostConfig": {
"Binds": [
"/dev:/dev",
"/etc/pwx:/etc/pwx",
"/opt/pwx/bin:/export_bin:shared",
"/var/run/docker.sock:/var/run/docker.sock",
"/var/cores:/var/cores",
"/usr/src:/usr/src",
"/run/docker/plugins:/run/docker/plugins",
"/var/lib/osd:/var/lib/osd:shared"
],
"ContainerIDFile": "",
"LogConfig": {
"Type": "json-file",
"Config": {}
},
"NetworkMode": "host",
"PortBindings": {},
"RestartPolicy": {
"Name": "unless-stopped",
"MaximumRetryCount": 0
},
"AutoRemove": false,
"VolumeDriver": "",
"VolumesFrom": null,
"CapAdd": null,
"CapDrop": null,
"Dns": [],
"DnsOptions": [],
"DnsSearch": [],
"ExtraHosts": null,
"GroupAdd": null,
"IpcMode": "",
"Cgroup": "",
"Links": null,
"OomScoreAdj": 0,
"PidMode": "",
"Privileged": true,
"PublishAllPorts": false,
"ReadonlyRootfs": false,
"SecurityOpt": [
"label=disable"
],
"UTSMode": "",
"UsernsMode": "",
"ShmSize": 67108864,
"Runtime": "runc",
"ConsoleSize": [
0,
0
],
"Isolation": "",
"CpuShares": 0,
"Memory": 0,
"NanoCpus": 0,
"CgroupParent": "",
"BlkioWeight": 0,
"BlkioWeightDevice": null,
"BlkioDeviceReadBps": null,
"BlkioDeviceWriteBps": null,
"BlkioDeviceReadIOps": null,
"BlkioDeviceWriteIOps": null,
"CpuPeriod": 0,
"CpuQuota": 0,
"CpuRealtimePeriod": 0,
"CpuRealtimeRuntime": 0,
"CpusetCpus": "",
"CpusetMems": "",
"Devices": [],
"DeviceCgroupRules": null,
"DiskQuota": 0,
"KernelMemory": 0,
"MemoryReservation": 0,
"MemorySwap": 0,
"MemorySwappiness": -1,
"OomKillDisable": false,
"PidsLimit": 0,
"Ulimits": null,
"CpuCount": 0,
"CpuPercent": 0,
"IOMaximumIOps": 0,
"IOMaximumBandwidth": 0
},
"GraphDriver": {
"Data": {
"LowerDir": "/var/lib/docker/overlay2/652c9affa013137d8a6a1337621f1598295591a912480ff581120200110b4440-init/diff:/var/lib/docker/overlay2/816f11c26bf35deb44fa8ba5ef28da9b2003cf6f0899616952b5fadd907e3cb7/diff:/var/lib/docker/overlay2/2df992b856da08483147a98ea2a1d84ccad966af43eb7eb5d268748485116ff7/diff:/var/lib/docker/overlay2/4b19d59eac40c26af70e9b83776b3378a1fbff15f3e24652b5f9dc8277b97101/diff:/var/lib/docker/overlay2/2a79aa3e856c37f90c72bc9f1391200050c476ecd45cea6cca075759cf9821f4/diff:/var/lib/docker/overlay2/a9b129c87d2b0beb769123a69cfc59ec108d5396b59a9f42ad89a79435aea17e/diff:/var/lib/docker/overlay2/10be848ebba274f65ae8a26ef236b6fe25743a4c56d56c64c25b494ab3b7cc60/diff:/var/lib/docker/overlay2/61c514a57b9c9ec1abf4aecf08837a59515a65d5f0cffc93a73754485c2bce99/diff",
"MergedDir": "/var/lib/docker/overlay2/652c9affa013137d8a6a1337621f1598295591a912480ff581120200110b4440/merged",
"UpperDir": "/var/lib/docker/overlay2/652c9affa013137d8a6a1337621f1598295591a912480ff581120200110b4440/diff",
"WorkDir": "/var/lib/docker/overlay2/652c9affa013137d8a6a1337621f1598295591a912480ff581120200110b4440/work"
},
"Name": "overlay2"
},
"Mounts": [
{
"Type": "bind",
"Source": "/opt/pwx/bin",
"Destination": "/export_bin",
"Mode": "shared",
"RW": true,
"Propagation": "shared"
},
{
"Type": "bind",
"Source": "/var/run/docker.sock",
"Destination": "/var/run/docker.sock",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Type": "bind",
"Source": "/var/cores",
"Destination": "/var/cores",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Type": "bind",
"Source": "/usr/src",
"Destination": "/usr/src",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Type": "bind",
"Source": "/run/docker/plugins",
"Destination": "/run/docker/plugins",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Type": "bind",
"Source": "/var/lib/osd",
"Destination": "/var/lib/osd",
"Mode": "shared",
"RW": true,
"Propagation": "shared"
},
{
"Type": "bind",
"Source": "/dev",
"Destination": "/dev",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Type": "bind",
"Source": "/etc/pwx",
"Destination": "/etc/pwx",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
}
],
"Config": {
"Hostname": "ip-10-0-1-9",
"Domainname": "",
"User": "",
"AttachStdin": false,
"AttachStdout": false,
"AttachStderr": false,
"Tty": false,
"OpenStdin": false,
"StdinOnce": false,
"Env": [
"PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
"DEBIAN_FRONTEND=noninteractive",
"container=docker",
"LD_LIBRARY_PATH=/usr/local/lib64:/usr/lib64/:/usr/lib",
"GOTRACEBACK=crash",
"GOMAXPROCS=64",
"PXMOD_SOURCE=/home/px-fuse",
"PXMOD_VERSION=2"
],
"Cmd": [
"-c",
"socialbase",
"-k",
"consul:http://swarmmanager.sb:8500",
"-s",
"vol-0842059ddbdbd9d7f"
],
"Image": "portworx/px-enterprise",
"Volumes": null,
"WorkingDir": "/",
"Entrypoint": [
"/docker-entry-point.sh"
],
"OnBuild": null,
"Labels": {}
},
"NetworkSettings": {
"Bridge": "",
"SandboxID": "bff969e4864b30ccfed2fbd482c2bff74a9c54019e9a9f03adb87055352cab6c",
"HairpinMode": false,
"LinkLocalIPv6Address": "",
"LinkLocalIPv6PrefixLen": 0,
"Ports": {},
"SandboxKey": "/var/run/docker/netns/default",
"SecondaryIPAddresses": null,
"SecondaryIPv6Addresses": null,
"EndpointID": "",
"Gateway": "",
"GlobalIPv6Address": "",
"GlobalIPv6PrefixLen": 0,
"IPAddress": "",
"IPPrefixLen": 0,
"IPv6Gateway": "",
"MacAddress": "",
"Networks": {
"host": {
"IPAMConfig": null,
"Links": null,
"Aliases": null,
"NetworkID": "398c38548ba4d94e5cc226c1340bce8c82400ec9ff2ec562e76b4d5cee61697a",
"EndpointID": "",
"Gateway": "",
"IPAddress": "",
"IPPrefixLen": 0,
"IPv6Gateway": "",
"GlobalIPv6Address": "",
"GlobalIPv6PrefixLen": 0,
"MacAddress": "",
"DriverOpts": null
}
}
}
}
]
root@ip-10-0-1-9:~# docker logs portworx
Executing with arguments: -c socialbase -k consul:http://swarmmanager.sb:8500 -s vol-0842059ddbdbd9d7f
Tue Jul 18 10:51:03 UTC 2017 : Running on Linux ip-10-0-1-9 4.4.0-83-generic #106-Ubuntu SMP Mon Jun 26 17:54:43 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
size for /dev/shm is 67108864, less than required 293601280
checking /hostusr/src/kernels/4.4.0-83-generic
checking /hostusr/src/linux-headers-4.4.0-83-generic
checking /usr/src/kernels/4.4.0-83-generic
checking /usr/src/linux-headers-4.4.0-83-generic
found /usr/src/linux-headers-4.4.0-83-generic
checking /usr/src/linux-headers-4.4.0-83-generic
found /usr/src/linux-headers-4.4.0-83-generic
Flushing buffer-cache..
checking /usr/src/linux-headers-4.4.0-83-generic
found /usr/src/linux-headers-4.4.0-83-generic
Creating px fs...
Using cluster: socialbase
Key Value Store: consul:http://swarmmanager.sb:8500
Using storage device: vol-0842059ddbdbd9d7f
Fingerprints values:
HOSTNAME={"type": 1, "value": "ip-10-0-1-9"}
DISK={ "type": 6, "value": "vol-0842059ddbdbd9d7f" }
NETWORK={ "type": 7, "value": "" }
NETWORK={ "type": 8, "value": "" }
FP_FINGERPRINT="Fingerprint": [{"type": 1, "value": "ip-10-0-1-9"}, { "type": 2, "value": "Unknown_Vendor" }, { "type": 3, "value": "Unknown_Model" }, { "type": 4, "value": "Intel(R) Xeon(R) CPU E5-2666 v3 @ 2.90GHz" }, { "type": 5, "value": "3852912 kB" }, { "type": 6, "value": "vol-0842059ddbdbd9d7f" }, { "type": 7, "value": "" }, { "type": 8, "value": "" }]
FP_CONFIG={ "token": "", "create_profile": true, "profile_name": "ip-10-0-1-9-ChYeAcsm", "Fingerprint": [{"type": 1, "value": "ip-10-0-1-9"}, { "type": 2, "value": "Unknown_Vendor" }, { "type": 3, "value": "Unknown_Model" }, { "type": 4, "value": "Intel(R) Xeon(R) CPU E5-2666 v3 @ 2.90GHz" }, { "type": 5, "value": "3852912 kB" }, { "type": 6, "value": "vol-0842059ddbdbd9d7f" }, { "type": 7, "value": "" }, { "type": 8, "value": "" }] }
To use the same hardware configuration on an identical node, please use hardware profile ip-10-0-1-9-ChYeAcsm.
example:
curl -fsSL http://lighthouse.portworx.com/px_bootstrap.sh > px_bootstrap.sh; chmod u+x px_bootstrap.sh; PWX_TOKEN= PWX_CLUSTERNAME=socialbase bootstrap.sh -p ip-10-0-1-9-ChYeAcsm
Failed to enable rt scheduler
Checking sysfs mount...
/docker-entry-point.sh: line 763: /sys/fs/cgroup/cpu/cpu.rt_runtime_us: Permission denied
2017-07-18 10:53:07,645 CRIT Supervisor running as root (no user in config file)
2017-07-18 10:53:07,648 INFO supervisord started with pid 1
2017-07-18 10:53:08,649 INFO spawned: 'relayd' with pid 1017
2017-07-18 10:53:08,651 INFO spawned: 'lttng' with pid 1018
2017-07-18 10:53:08,653 INFO spawned: 'exec' with pid 1019
2017-07-18 10:53:08,654 INFO spawned: 'pxdaemon' with pid 1020
2017-07-18 10:53:08,655 INFO spawned: 'px-ns' with pid 1021
2017-07-18 10:53:08,656 INFO spawned: 'px_event_listener' with pid 1022
PXPROCS: lttng not started yet...sleeping
Tue Jul 18 10:53:09 UTC 2017 cleaning up old traces
Tue Jul 18 10:53:09 UTC 2017 done cleaning up old traces
2017-07-18 10:53:10,430 INFO success: relayd entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:53:10,430 INFO success: lttng entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:53:10,431 INFO success: exec entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:53:10,431 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:53:10,431 INFO success: px-ns entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:53:10,431 INFO success: px_event_listener entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: lttng not started yet...sleeping
time="2017-07-18T10:53:12Z" level=info msg="px-ns Starting.."
NS client starting fuse module
Starting NS server
PXPROCS: lttng not started yet...sleeping
PXPROCS: lttng not started yet...sleeping
Spawning a session daemon
Session pxd created.
Traces will be written in net://localhost
Live timer set to 2000000 usec
Session pxd set to shm_path: /var/lib/osd/lttng/pxd-20170718-105320.
UST channel pxd_channel enabled for session pxd
All UST events are enabled in channel pxd_channel
Tracing started for session pxd
PXPROCS: Started px-storage with pid 1095
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
PXPROCS: Started px with pid 1110
PXPROCS: Started watchdog with pid 1111
2017-07-18_10:53:31: PX-Watchdog: Starting watcher
2017-07-18_10:53:31: PX-Watchdog: Waiting for px process to start
root 1110 1020 0 10:53 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:53:31: PX-Watchdog: (pid 1110): Begin monitoring
time="2017-07-18T10:53:54Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:53:54Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:53:54Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 5ea17f26-cbaf-4088-bc9d-674b36594b52). Trying attachSet..."
time="2017-07-18T10:53:55Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:53:55Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:53:55Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:53:55Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:53:55Z" level=info msg="New NodeID is f33f56bc-8c56-4aed-a6e3-582f3a1ff74e"
time="2017-07-18T10:53:55Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:53:55Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:53:55Z" level=info msg="Node f33f56bc-8c56-4aed-a6e3-582f3a1ff74e with Index (246) is Up"
time="2017-07-18T10:54:05Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-01e92c660a0907784' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 2205893c-860f-4957-87d6-1a1464fda794). Trying attachSet..."
time="2017-07-18T10:54:06Z" level=info msg="Failed to attach volume set vol-01e92c660a0907784 vol-0722a31be004a9641 vol-0748ad5024791b39c with error: No sets available. Retrying..."
time="2017-07-18T10:54:09Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-01e92c660a0907784' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 6455ed30-f7f1-4cb0-bb7e-15ea1bf63daf). Trying attachSet..."
time="2017-07-18T10:54:10Z" level=info msg="Failed to attach volume set vol-01e92c660a0907784 vol-0722a31be004a9641 vol-0748ad5024791b39c with error: No sets available. Retrying..."
time="2017-07-18T10:54:13Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-01e92c660a0907784' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: a9633b17-f402-4c98-8538-0dcaddd03f1c). Trying attachSet..."
time="2017-07-18T10:54:13Z" level=info msg="Failed to attach volume set vol-01e92c660a0907784 vol-0722a31be004a9641 vol-0748ad5024791b39c with error: No sets available. Retrying..."
time="2017-07-18T10:54:16Z" level=info msg="Deleted volume vol-01e92c660a0907784"
time="2017-07-18T10:54:16Z" level=info msg="Deleted volume vol-0722a31be004a9641"
time="2017-07-18T10:54:16Z" level=info msg="Deleted volume vol-0748ad5024791b39c"
time="2017-07-18T10:54:16Z" level=warning msg="Unable to save new node configuration: No sets available"
PXPROCS: px daemon exited with code: 6
1095
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:54:18,016 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:54:19,018 INFO spawned: 'pxdaemon' with pid 1163
PXPROCS: Started px-storage with pid 1167
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:54:20,431 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1176
PXPROCS: Started watchdog with pid 1177
2017-07-18_10:54:22: PX-Watchdog: Starting watcher
2017-07-18_10:54:22: PX-Watchdog: Waiting for px process to start
root 1176 1163 0 10:54 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:54:22: PX-Watchdog: (pid 1176): Begin monitoring
time="2017-07-18T10:54:22Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:54:22Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:54:22Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: a0413bca-fec1-40d4-8adf-cdcb2f1bf601). Trying attachSet..."
time="2017-07-18T10:54:22Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:54:22Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:54:22Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:54:22Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:54:22Z" level=info msg="New NodeID is 5c5ae62f-e7c7-4957-8506-c671a5b9d737"
time="2017-07-18T10:54:22Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:54:22Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:54:22Z" level=info msg="Node 5c5ae62f-e7c7-4957-8506-c671a5b9d737 with Index (247) is Up"
2017-07-18 10:54:34,583 INFO reaped unknown pid 1147
time="2017-07-18T10:54:36Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-0045ab04b85a40940' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 1c1be04c-4e63-4b0b-9474-c5f001ba8016). Trying attachSet..."
time="2017-07-18T10:54:36Z" level=info msg="Failed to attach volume set vol-0a1e5ee018c71527a vol-0045ab04b85a40940 vol-0db8469b82c820b0e with error: No sets available. Retrying..."
time="2017-07-18T10:54:40Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-0045ab04b85a40940' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: e83f499b-8e63-4278-9423-444a31f9b5de). Trying attachSet..."
time="2017-07-18T10:54:40Z" level=info msg="Failed to attach volume set vol-0a1e5ee018c71527a vol-0045ab04b85a40940 vol-0db8469b82c820b0e with error: No sets available. Retrying..."
time="2017-07-18T10:54:43Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-0045ab04b85a40940' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 8a785d50-0149-43c3-a61a-e1407e26751b). Trying attachSet..."
time="2017-07-18T10:54:43Z" level=info msg="Failed to attach volume set vol-0a1e5ee018c71527a vol-0045ab04b85a40940 vol-0db8469b82c820b0e with error: No sets available. Retrying..."
time="2017-07-18T10:54:46Z" level=info msg="Deleted volume vol-0a1e5ee018c71527a"
time="2017-07-18T10:54:46Z" level=info msg="Deleted volume vol-0045ab04b85a40940"
time="2017-07-18T10:54:46Z" level=info msg="Deleted volume vol-0db8469b82c820b0e"
time="2017-07-18T10:54:46Z" level=warning msg="Unable to save new node configuration: No sets available"
PXPROCS: px daemon exited with code: 6
1167
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:54:47,744 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:54:48,746 INFO spawned: 'pxdaemon' with pid 1270
PXPROCS: Started px-storage with pid 1271
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:54:50,333 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1280
PXPROCS: Started watchdog with pid 1281
2017-07-18_10:54:51: PX-Watchdog: Starting watcher
2017-07-18_10:54:51: PX-Watchdog: Waiting for px process to start
root 1280 1270 0 10:54 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:54:51: PX-Watchdog: (pid 1280): Begin monitoring
time="2017-07-18T10:54:52Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:54:52Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:54:53Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 7a7c0606-7853-4c0e-8c9b-32d92b720ffc). Trying attachSet..."
time="2017-07-18T10:54:53Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:54:53Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:54:53Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:54:53Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:54:53Z" level=info msg="New NodeID is 8e427559-aa6a-4c4d-9aee-45a3d85a0489"
time="2017-07-18T10:54:53Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:54:53Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:54:53Z" level=info msg="Node 8e427559-aa6a-4c4d-9aee-45a3d85a0489 with Index (248) is Up"
Waiting for state transition to "attached".
time="2017-07-18T10:55:09Z" level=warning msg="Rollback attach volume /dev/xvdf, Error InvalidVolume.ZoneMismatch: The volume 'vol-072595a4306d3de9e' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: bf3e13d6-3fda-48e1-9c22-80af40833c14"
time="2017-07-18T10:55:09Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-072595a4306d3de9e' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: bf3e13d6-3fda-48e1-9c22-80af40833c14). Trying attachSet..."
time="2017-07-18T10:55:10Z" level=info msg="Failed to attach volume set vol-072595a4306d3de9e vol-0862f831687cbcde0 vol-0076642dbddf81835 with error: No sets available. Retrying..."
time="2017-07-18T10:55:13Z" level=warning msg="AttachSet: attachVols failed with error (VolumeInUse: vol-0076642dbddf81835 is already attached to an instance\n\tstatus code: 400, request id: 9df38e28-0a35-44a6-b99e-e3894fed6044). Trying attachSet..."
time="2017-07-18T10:55:13Z" level=info msg="Failed to attach volume set vol-072595a4306d3de9e vol-0862f831687cbcde0 vol-0076642dbddf81835 with error: No sets available. Retrying..."
time="2017-07-18T10:55:17Z" level=warning msg="AttachSet: attachVols failed with error (VolumeInUse: vol-0076642dbddf81835 is already attached to an instance\n\tstatus code: 400, request id: 27b24a7b-a945-4904-b953-7722de251354). Trying attachSet..."
time="2017-07-18T10:55:17Z" level=info msg="Failed to attach volume set vol-072595a4306d3de9e vol-0862f831687cbcde0 vol-0076642dbddf81835 with error: No sets available. Retrying..."
time="2017-07-18T10:55:20Z" level=info msg="Deleted volume vol-072595a4306d3de9e"
time="2017-07-18T10:55:20Z" level=info msg="Deleted volume vol-0862f831687cbcde0"
time="2017-07-18T10:55:20Z" level=warning msg="Delete failed for volume vol-0076642dbddf81835, Error VolumeInUse: Volume vol-0076642dbddf81835 is currently attached to i-07b3f498e4a2a2441\n\tstatus code: 400, request id: 384b1d45-f76e-4a3b-89a7-fb252b042837"
time="2017-07-18T10:55:20Z" level=info msg="Deleted volume vol-0076642dbddf81835"
time="2017-07-18T10:55:20Z" level=warning msg="Unable to save new node configuration: No sets available"
PXPROCS: px daemon exited with code: 6
1271
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:55:21,528 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:55:22,530 INFO spawned: 'pxdaemon' with pid 1334
PXPROCS: Started px-storage with pid 1335
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:55:24,113 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
2017-07-18 10:55:25,080 INFO reaped unknown pid 1221
PXPROCS: Started px with pid 1344
PXPROCS: Started watchdog with pid 1345
2017-07-18_10:55:25: PX-Watchdog: Starting watcher
2017-07-18_10:55:25: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1167-time1500375286
2017-07-18_10:55:25: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:54:46.872231153 +0000 UTC.stack
2017-07-18_10:55:25: PX-Watchdog: Waiting for px process to start
root 1344 1334 0 10:55 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:55:25: PX-Watchdog: (pid 1344): Begin monitoring
time="2017-07-18T10:55:25Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:55:25Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:55:26Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 3336988e-151c-4e32-8c83-9a58add6bcc7). Trying attachSet..."
time="2017-07-18T10:55:26Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:55:26Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:55:26Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:55:26Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:55:26Z" level=info msg="New NodeID is d7b5b974-00aa-42a9-aee5-f87a05be7cc0"
time="2017-07-18T10:55:26Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:55:26Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:55:26Z" level=info msg="Node d7b5b974-00aa-42a9-aee5-f87a05be7cc0 with Index (249) is Up"
Waiting for state transition to "attached".
time="2017-07-18T10:55:49Z" level=warning msg="Rollback attach volume /dev/xvdg, Error InvalidVolume.ZoneMismatch: The volume 'vol-07eb92cc0a06b8feb' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 69ba12b8-59ee-41e1-9cb3-9a2b4332a088"
time="2017-07-18T10:55:49Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-07eb92cc0a06b8feb' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 69ba12b8-59ee-41e1-9cb3-9a2b4332a088). Trying attachSet..."
time="2017-07-18T10:55:49Z" level=info msg="Failed to attach volume set vol-07eb92cc0a06b8feb vol-09c30c6651676c12c vol-0462d35b85744c45d with error: No sets available. Retrying..."
time="2017-07-18T10:55:53Z" level=warning msg="AttachSet: attachVols failed with error (VolumeInUse: vol-0462d35b85744c45d is already attached to an instance\n\tstatus code: 400, request id: cc0ef9ac-3132-447b-8937-d38dc0f78b01). Trying attachSet..."
time="2017-07-18T10:55:53Z" level=info msg="Failed to attach volume set vol-07eb92cc0a06b8feb vol-09c30c6651676c12c vol-0462d35b85744c45d with error: No sets available. Retrying..."
2017-07-18 10:55:54,835 INFO reaped unknown pid 1322
time="2017-07-18T10:55:56Z" level=warning msg="AttachSet: attachVols failed with error (VolumeInUse: vol-0462d35b85744c45d is already attached to an instance\n\tstatus code: 400, request id: 034ac855-3eee-4ded-90eb-0a1f4168b9af). Trying attachSet..."
time="2017-07-18T10:55:57Z" level=info msg="Failed to attach volume set vol-07eb92cc0a06b8feb vol-09c30c6651676c12c vol-0462d35b85744c45d with error: No sets available. Retrying..."
time="2017-07-18T10:56:00Z" level=info msg="Deleted volume vol-07eb92cc0a06b8feb"
time="2017-07-18T10:56:00Z" level=info msg="Deleted volume vol-09c30c6651676c12c"
time="2017-07-18T10:56:00Z" level=warning msg="Delete failed for volume vol-0462d35b85744c45d, Error VolumeInUse: Volume vol-0462d35b85744c45d is currently attached to i-07b3f498e4a2a2441\n\tstatus code: 400, request id: 288c9da7-c3d0-4579-91ff-ea9383b611e1"
time="2017-07-18T10:56:00Z" level=info msg="Deleted volume vol-0462d35b85744c45d"
time="2017-07-18T10:56:00Z" level=warning msg="Unable to save new node configuration: No sets available"
PXPROCS: px daemon exited with code: 6
1335
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:56:01,184 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:56:02,186 INFO spawned: 'pxdaemon' with pid 1400
PXPROCS: Started px-storage with pid 1401
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:56:03,788 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1410
PXPROCS: Started watchdog with pid 1411
2017-07-18_10:56:05: PX-Watchdog: Starting watcher
2017-07-18_10:56:05: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1271-time1500375320
2017-07-18_10:56:05: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:55:20.638325924 +0000 UTC.stack
2017-07-18_10:56:05: PX-Watchdog: Waiting for px process to start
root 1410 1400 0 10:56 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:56:05: PX-Watchdog: (pid 1410): Begin monitoring
time="2017-07-18T10:56:05Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:56:05Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:56:05Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: d97f7fcc-841d-4991-9b36-6729171b4899). Trying attachSet..."
time="2017-07-18T10:56:06Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:56:06Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:56:06Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:56:06Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:56:06Z" level=info msg="New NodeID is a5a3e22d-6657-4bd2-a384-15bc6dc970e8"
time="2017-07-18T10:56:06Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:56:06Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:56:06Z" level=info msg="Node a5a3e22d-6657-4bd2-a384-15bc6dc970e8 with Index (250) is Up"
time="2017-07-18T10:56:20Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-07c8b6cdb5c947ecd' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 0a96f615-0a89-43af-9ab1-0ce0b5ca8776). Trying attachSet..."
time="2017-07-18T10:56:20Z" level=info msg="Failed to attach volume set vol-0971737c1e9ebbadf vol-07c8b6cdb5c947ecd vol-0cb491c9f8c50d6e6 with error: No sets available. Retrying..."
time="2017-07-18T10:56:23Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-07c8b6cdb5c947ecd' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: cb4c7fb5-df33-4eb8-bc17-6321b0e49d24). Trying attachSet..."
time="2017-07-18T10:56:23Z" level=info msg="Failed to attach volume set vol-0971737c1e9ebbadf vol-07c8b6cdb5c947ecd vol-0cb491c9f8c50d6e6 with error: No sets available. Retrying..."
time="2017-07-18T10:56:26Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-07c8b6cdb5c947ecd' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: c9597efe-0c23-48d7-ad82-6a8fa759f73c). Trying attachSet..."
time="2017-07-18T10:56:26Z" level=info msg="Failed to attach volume set vol-0971737c1e9ebbadf vol-07c8b6cdb5c947ecd vol-0cb491c9f8c50d6e6 with error: No sets available. Retrying..."
2017-07-18 10:56:28,673 INFO reaped unknown pid 1392
time="2017-07-18T10:56:31Z" level=info msg="Deleted volume vol-0971737c1e9ebbadf"
time="2017-07-18T10:56:31Z" level=info msg="Deleted volume vol-07c8b6cdb5c947ecd"
time="2017-07-18T10:56:31Z" level=info msg="Deleted volume vol-0cb491c9f8c50d6e6"
time="2017-07-18T10:56:31Z" level=warning msg="Unable to save new node configuration: No sets available"
PXPROCS: px daemon exited with code: 6
1401
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:56:32,332 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:56:33,334 INFO spawned: 'pxdaemon' with pid 1465
PXPROCS: Started px-storage with pid 1466
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:56:34,928 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1475
PXPROCS: Started watchdog with pid 1476
2017-07-18_10:56:36: PX-Watchdog: Starting watcher
2017-07-18_10:56:36: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1335-time1500375360
2017-07-18_10:56:36: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:56:00.475475355 +0000 UTC.stack
2017-07-18_10:56:36: PX-Watchdog: Waiting for px process to start
time="2017-07-18T10:56:36Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:56:36Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
root 1475 1465 0 10:56 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:56:36: PX-Watchdog: (pid 1475): Begin monitoring
time="2017-07-18T10:56:37Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 8a443c49-dcb7-45e0-8bbf-f73f0aa758a5). Trying attachSet..."
time="2017-07-18T10:56:37Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:56:37Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:56:37Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:56:37Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:56:37Z" level=info msg="New NodeID is 01605562-5a58-40f6-97db-a3c0e5e21cd4"
time="2017-07-18T10:56:37Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:56:37Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:56:37Z" level=info msg="Node 01605562-5a58-40f6-97db-a3c0e5e21cd4 with Index (251) is Up"
time="2017-07-18T10:56:42Z" level=warning msg="Unable to save new node configuration: RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: e45a1262-0b6a-4088-8e4c-3af58f566909"
PXPROCS: px daemon exited with code: 6
1466
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:56:43,031 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:56:44,034 INFO spawned: 'pxdaemon' with pid 1531
PXPROCS: Started px-storage with pid 1532
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:56:45,609 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1541
PXPROCS: Started watchdog with pid 1542
2017-07-18_10:56:47: PX-Watchdog: Starting watcher
2017-07-18_10:56:47: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1401-time1500375391
2017-07-18_10:56:47: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:56:31.564340556 +0000 UTC.stack
2017-07-18_10:56:47: PX-Watchdog: Waiting for px process to start
root 1541 1531 0 10:56 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:56:47: PX-Watchdog: (pid 1541): Begin monitoring
time="2017-07-18T10:56:47Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:56:47Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
time="2017-07-18T10:56:53Z" level=warning msg="AttachSet: attachVols failed with error (RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: b553ba36-2462-4ca2-900f-fc9ca146ccb1). Trying attachSet..."
time="2017-07-18T10:56:53Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:56:53Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:56:53Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:56:53Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:56:53Z" level=info msg="New NodeID is ec9aaaef-96bf-4f35-bd69-ef951b0c3c23"
time="2017-07-18T10:56:53Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:56:53Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:56:53Z" level=info msg="Node ec9aaaef-96bf-4f35-bd69-ef951b0c3c23 with Index (252) is Up"
time="2017-07-18T10:56:58Z" level=warning msg="Unable to save new node configuration: RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: c48e8eaa-e142-428b-88cc-616e1b080c47"
PXPROCS: px daemon exited with code: 6
1532
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:56:58,389 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:56:59,392 INFO spawned: 'pxdaemon' with pid 1596
PXPROCS: Started px-storage with pid 1597
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:57:00,995 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1606
PXPROCS: Started watchdog with pid 1607
2017-07-18_10:57:02: PX-Watchdog: Starting watcher
2017-07-18_10:57:02: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1466-time1500375402
time="2017-07-18T10:57:02Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:57:02Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
2017-07-18_10:57:02: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:56:42.843407744 +0000 UTC.stack
2017-07-18_10:57:02: PX-Watchdog: Waiting for px process to start
root 1606 1596 0 10:57 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:57:02: PX-Watchdog: (pid 1606): Begin monitoring
time="2017-07-18T10:57:04Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: e83754da-f751-46f9-abba-f84d6484cae1). Trying attachSet..."
time="2017-07-18T10:57:04Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:57:04Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:57:04Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:57:04Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:57:04Z" level=info msg="New NodeID is 64f13526-0f47-4389-a169-30054dd761ad"
time="2017-07-18T10:57:04Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:57:04Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:57:04Z" level=info msg="Node 64f13526-0f47-4389-a169-30054dd761ad with Index (253) is Up"
time="2017-07-18T10:57:08Z" level=warning msg="Unable to save new node configuration: RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: dba20d27-9911-41f1-83bd-6b6571220d78"
PXPROCS: px daemon exited with code: 6
1597
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:57:08,304 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:57:08,382 INFO spawned: 'pxdaemon' with pid 1660
2017-07-18 10:57:08,383 INFO reaped unknown pid 1458
PXPROCS: Started px-storage with pid 1661
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:57:09,998 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1670
PXPROCS: Started watchdog with pid 1671
2017-07-18_10:57:11: PX-Watchdog: Starting watcher
2017-07-18_10:57:11: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1532-time1500375418
time="2017-07-18T10:57:11Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:57:11Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
2017-07-18_10:57:11: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:56:58.194271715 +0000 UTC.stack
2017-07-18_10:57:11: PX-Watchdog: Waiting for px process to start
root 1670 1660 0 10:57 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:57:11: PX-Watchdog: (pid 1670): Begin monitoring
time="2017-07-18T10:57:11Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 7a4d7db4-0fb4-4f60-aa9b-517d11dfb65b). Trying attachSet..."
time="2017-07-18T10:57:12Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:57:12Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:57:12Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:57:12Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:57:12Z" level=info msg="New NodeID is ea37d08f-2881-4e01-bc35-a6935911d8dd"
time="2017-07-18T10:57:12Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:57:12Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:57:12Z" level=info msg="Node ea37d08f-2881-4e01-bc35-a6935911d8dd with Index (254) is Up"
time="2017-07-18T10:57:18Z" level=warning msg="Unable to save new node configuration: RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: d00530ee-3a80-4927-b814-b936b810309b"
PXPROCS: px daemon exited with code: 6
1661
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:57:18,292 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:57:19,294 INFO spawned: 'pxdaemon' with pid 1725
PXPROCS: Started px-storage with pid 1726
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:57:20,801 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1738
PXPROCS: Started watchdog with pid 1739
2017-07-18_10:57:22: PX-Watchdog: Starting watcher
2017-07-18_10:57:22: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1597-time1500375428
2017-07-18_10:57:22: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:57:08.223482517 +0000 UTC.stack
2017-07-18_10:57:22: PX-Watchdog: Waiting for px process to start
time="2017-07-18T10:57:22Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:57:22Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
root 1738 1725 0 10:57 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:57:22: PX-Watchdog: (pid 1738): Begin monitoring
time="2017-07-18T10:57:22Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 6bddda3d-844b-4dd2-a2ec-035aa6f13fd4). Trying attachSet..."
time="2017-07-18T10:57:22Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:57:22Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:57:22Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:57:22Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:57:22Z" level=info msg="New NodeID is daeac1c7-0ef3-49be-bbc9-ce2eb087712e"
time="2017-07-18T10:57:22Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:57:22Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:57:23Z" level=info msg="Node daeac1c7-0ef3-49be-bbc9-ce2eb087712e with Index (255) is Up"
time="2017-07-18T10:57:28Z" level=warning msg="Unable to save new node configuration: RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: d9e66ea2-1259-413f-9896-444ade4dd17f"
PXPROCS: px daemon exited with code: 6
1726
PXPROCS: dumping core and killing px-storage...
2017-07-18 10:57:29,018 INFO exited: pxdaemon (exit status 6; not expected)
2017-07-18 10:57:30,020 INFO spawned: 'pxdaemon' with pid 1794
PXPROCS: Started px-storage with pid 1795
bash: connect: Connection refused
bash: /dev/tcp/localhost/9009: Connection refused
PXPROCS: px-storage not started yet...sleeping
C++ grpc server listening on 0.0.0.0:9009
2017-07-18 10:57:31,620 INFO success: pxdaemon entered RUNNING state, process has stayed up for > than 1 seconds (startsecs)
PXPROCS: Started px with pid 1804
PXPROCS: Started watchdog with pid 1805
2017-07-18_10:57:33: PX-Watchdog: Starting watcher
2017-07-18_10:57:33: PX-Watchdog: (pid 0): Removing intermediate px core file: core-px-storage-sig6-user0-group0-pid1661-time1500375438
2017-07-18_10:57:33: PX-Watchdog: (pid 0): Removing intermediate px core file: 2017-07-18 10:57:18.215382323 +0000 UTC.stack
2017-07-18_10:57:33: PX-Watchdog: Waiting for px process to start
time="2017-07-18T10:57:33Z" level=info msg="Registering [kernel] as a volume driver"
time="2017-07-18T10:57:33Z" level=info msg="Starting PX Version: 1.2.8-e70082e - Build Version e70082e281be8b71872b09a3304926438466fc5b"
root 1804 1794 0 10:57 ? 00:00:00 /usr/local/bin/px -daemon
2017-07-18_10:57:33: PX-Watchdog: (pid 1804): Begin monitoring
time="2017-07-18T10:57:36Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-002334b285798b4ac' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 5b2f3eb3-e5e8-4fe1-b707-3eadad7afaf3). Trying attachSet..."
time="2017-07-18T10:57:36Z" level=info msg="AttachSet while Load of private config failed: No sets available"
time="2017-07-18T10:57:36Z" level=info msg="Node is not yet initialized"
time="2017-07-18T10:57:36Z" level=error msg="Unsupported scheduler none given to storage driver"
time="2017-07-18T10:57:36Z" level=info msg="Generating a new Node ID and index."
time="2017-07-18T10:57:36Z" level=info msg="New NodeID is 90953e20-5f0b-4483-b4b4-abb9b43e0636"
time="2017-07-18T10:57:36Z" level=info msg="Joining cluster socialbase at this discovery service: [swarmmanager.sb:8500]."
time="2017-07-18T10:57:36Z" level=info msg="Alerts initialized successfully for this cluster"
time="2017-07-18T10:57:36Z" level=info msg="Node 90953e20-5f0b-4483-b4b4-abb9b43e0636 with Index (256) is Up"
2017-07-18 10:57:39,537 INFO reaped unknown pid 1525
time="2017-07-18T10:57:48Z" level=warning msg="AttachSet: attachVols failed with error (InvalidVolume.ZoneMismatch: The volume 'vol-031fddb2bffbc8d8d' is not in the same availability zone as instance 'i-07b3f498e4a2a2441'\n\tstatus code: 400, request id: 36bb811b-e847-4b66-bf2f-0e0d4c2b84d8). Trying attachSet..."
time="2017-07-18T10:57:48Z" level=info msg="Failed to attach volume set vol-09ed82552258e7f86 vol-031fddb2bffbc8d8d vol-0c40674bc3d576274 with error: No sets available. Retrying..."
2017-07-18 10:57:50,164 INFO reaped unknown pid 1589
time="2017-07-18T10:57:57Z" level=warning msg="AttachSet: attachVols failed with error (RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: 34d8b49e-9904-46ed-8dd5-cd1e304a22ae). Trying attachSet..."
time="2017-07-18T10:57:57Z" level=info msg="Failed to attach volume set vol-09ed82552258e7f86 vol-031fddb2bffbc8d8d vol-0c40674bc3d576274 with error: No sets available. Retrying..."
2017-07-18 10:58:05,591 INFO reaped unknown pid 1655
time="2017-07-18T10:58:06Z" level=warning msg="AttachSet: attachVols failed with error (RequestLimitExceeded: Request limit exceeded.\n\tstatus code: 503, request id: 2565b4ad-a0e0-4720-b451-d552069ae84c). Trying attachSet..."
time="2017-07-18T10:58:06Z" level=info msg="Failed to attach volume set vol-09ed82552258e7f86 vol-031fddb2bffbc8d8d vol-0c40674bc3d576274 with error: No sets available. Retrying..."
2017-07-18 10:58:10,654 WARN received SIGTERM indicating exit request
2017-07-18 10:58:10,654 INFO waiting for lttng, px-ns, exec, px_event_listener, relayd, pxdaemon to die
2017-07-18 10:58:10,655 INFO stopped: px_event_listener (terminated by SIGTERM)
2017-07-18 10:58:10,656 INFO stopped: px-ns (terminated by SIGTERM)
PXPROCS: Received SIGTERM. Exiting PX gracefully....
PXPROCS: trying to kill px daemon...
time="2017-07-18T10:58:10Z" level=info msg="Received signal 'terminated', starting shutdown..."
time="2017-07-18T10:58:10Z" level=info msg="Shutdown complete"
PXPROCS: px daemon exited with code: 0
1795
PXPROCS: trying to kill px-storage...
2017-07-18 10:58:10,713 INFO stopped: pxdaemon (exit status 0)
2017-07-18 10:58:10,714 INFO stopped: exec (terminated by SIGTERM)
Waiting for data availability
Tracing stopped for session pxd
Session pxd destroyed
Exiting lttng
2017-07-18 10:58:10,728 INFO stopped: lttng (exit status 0)
2017-07-18 10:58:10,857 INFO stopped: relayd (exit status 0)
root@ip-10-0-1-9:~# curl -s http://169.254.169.254/latest/meta-data/placement/availability-zone ; echo -e '\n'
sa-east-1a
@jvinod, I've make it works now using px-enterprise as a container and with only one disk in the config. My last attempt using this aproach not worked because I've forgot to remove the portworx section with all of the 3 template disks from the instance userdata.
Great thanks. We'll fix this so that you don't need to do this customization.
This is fixed in 1.2.10
I've setup a statefull ami accordig to this guide. In my setup, I've chosen to not use a systemd service startup script, instead, I'm just using
--restart=unless-stopped
from docker run parameter.I've created 3 vols in 3 different AZs: sa-east-1a, sa-east-b and sa-east-1c, each one with 30GB:
To create the px container in my statefull ami, I've did the bellow steps:
After that, I've created the ami and launched a new instance from it with this userdata:
After instance is running,
pxctl status
shows this:Checking the AWS console, I've noticed that portworx created a lot of 30GB disks in all 3 zones!
This is the container log. It seems that it entered in a looping state: