Seagate / cortx-re

CORTX Release Engineering Scripts and Dockerfiles.
https://github.com/Seagate/cortx
GNU Affero General Public License v3.0
5 stars 52 forks source link

CORTX-32909: Instructions to build cortx images on AWS (Multi-node) setup with Terraform #1156

Closed mukul-seagate11 closed 2 years ago

mukul-seagate11 commented 2 years ago

Problem Statement

Design

Coding

Checklist for Author

Testing

Checklist for Author

Impact Analysis

Checklist for Author/Reviewer/GateKeeper

Review Checklist

Checklist for Author

Documentation

Checklist for Author

mukul-seagate11 commented 2 years ago

regarding comment https://github.com/Seagate/cortx-re/pull/1156#discussion_r941307408, which was already tested earlier which was not installing docker package on worker nodes so based on that updated in setup.sh script

mukul-seagate11 commented 2 years ago

ip-172-31-X-X: --------------- SUCCESS -------------------- node/ip-172-31-X-X.ap-south-1.compute.internal labeled --------------- SUCCESS -------------------------------------

node/ip-172-31-X-Y.ap-south-1.compute.internal labeled --------------- SUCCESS -------------------------------------

mukul-seagate11 commented 2 years ago

https://github.com/Seagate/cortx-re/pull/1156#discussion_r941307408

I still see docker package is not installed on the worker nodes with the previous code as testing,

[root@ip-172-31-XX-YY tmp]# ls |grep tar cortximages.tar [root@ip-172-31-XX-YY tmp]# for image in /tmp/*.tar; do cat $image | docker load; done bash: docker: command not found

[root@ip-172-31-XX-YY tmp]# ls |grep tar cortximages.tar [root@ip-172-31-XX-YY tmp]# for image in /tmp/*.tar; do cat $image | docker load; done bash: docker: command not found

So, re-added the code to install docker

mukul-seagate11 commented 2 years ago

Please add CORTX deployment steps used. Is there any modification required

--------------- hctl status --------------- Wed Aug 10 09:33:09 UTC 2022 Unable to use a TTY - input is not a terminal or the right kind of file Unable to use a TTY - input is not a terminal or the right kind of file Unable to use a TTY - input is not a terminal or the right kind of file Bytecount: critical : 0 damaged : 0 degraded : 0 healthy : 0 Data pool:

fid name

0x6f00000000000001:0x0 'storage-set-1__sns'

Profile:

fid name: pool(s)

0x7000000000000001:0x0 'Profile_the_pool': 'storage-set-1__sns' 'storage-set-1__dix' None

Services: cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x0 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x1 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x2 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x3 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x4 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x5 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x6 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x7 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x8 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x9 inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0xa inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0xb inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local (RC) [started] hax 0x7200000000000001:0xc inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0xd inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0xe inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0xf inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x10 inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x11 inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@21002 cortx-server-0.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x12 inet:tcp:cortx-server-0.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x13 inet:tcp:cortx-server-0.cortx-server-headless.default.svc.cluster.local@22501 cortx-server-1.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x14 inet:tcp:cortx-server-1.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x15 inet:tcp:cortx-server-1.cortx-server-headless.default.svc.cluster.local@22501 cortx-server-2.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x16 inet:tcp:cortx-server-2.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x17 inet:tcp:cortx-server-2.cortx-server-headless.default.svc.cluster.local@22501 ==================== Time taken for service to start 0 mins ====================

mukul-seagate11 commented 2 years ago

Please add CORTX deployment steps used. Is there any modification required

==================== POD Status ====================

NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE RE ADINESS GATES cortx-consul-client-pbz7w 1/1 Running 0 2m45s 192.168.203.68 ip-172-31-43-221.ap-south-1.compute.internal cortx-consul-client-shgr4 1/1 Running 0 2m45s 192.168.235.4 ip-172-31-41-132.ap-south-1.compute.internal cortx-consul-client-zzfvw 1/1 Running 0 2m45s 192.168.225.3 ip-172-31-37-204.ap-south-1.compute.internal cortx-consul-server-0 1/1 Running 0 2m45s 192.168.225.5 ip-172-31-37-204.ap-south-1.compute.internal cortx-consul-server-1 1/1 Running 0 2m44s 192.168.203.71 ip-172-31-43-221.ap-south-1.compute.internal cortx-consul-server-2 1/1 Running 0 2m44s 192.168.235.9 ip-172-31-41-132.ap-south-1.compute.internal cortx-control-6d5d4495bb-78qs5 1/1 Running 0 2m45s 192.168.235.5 ip-172-31-41-132.ap-south-1.compute.internal cortx-data-g0-0 3/3 Running 0 2m44s 192.168.225.11 ip-172-31-37-204.ap-south-1.compute.internal cortx-data-g0-1 3/3 Running 0 2m43s 192.168.235.16 ip-172-31-41-132.ap-south-1.compute.internal cortx-data-g0-2 3/3 Running 0 2m42s 192.168.203.80 ip-172-31-43-221.ap-south-1.compute.internal cortx-data-g1-0 3/3 Running 0 2m44s 192.168.225.10 ip-172-31-37-204.ap-south-1.compute.internal cortx-data-g1-1 3/3 Running 0 2m43s 192.168.235.13 ip-172-31-41-132.ap-south-1.compute.internal cortx-data-g1-2 3/3 Running 0 2m42s 192.168.203.79 ip-172-31-43-221.ap-south-1.compute.internal cortx-ha-9f8846d85-9bkmb 3/3 Running 0 2m45s 192.168.235.6 ip-172-31-41-132.ap-south-1.compute.internal cortx-kafka-0 1/1 Running 0 2m43s 192.168.203.77 ip-172-31-43-221.ap-south-1.compute.internal cortx-kafka-1 1/1 Running 0 2m43s 192.168.225.13 ip-172-31-37-204.ap-south-1.compute.internal cortx-kafka-2 1/1 Running 0 2m42s 192.168.235.18 ip-172-31-41-132.ap-south-1.compute.internal cortx-server-0 2/2 Running 0 2m45s 192.168.225.7 ip-172-31-37-204.ap-south-1.compute.internal cortx-server-1 2/2 Running 0 2m44s 192.168.235.10 ip-172-31-41-132.ap-south-1.compute.internal cortx-server-2 2/2 Running 0 2m44s 192.168.203.76 ip-172-31-43-221.ap-south-1.compute.internal cortx-zookeeper-0 1/1 Running 0 2m45s 192.168.225.6 ip-172-31-37-204.ap-south-1.compute.internal cortx-zookeeper-1 1/1 Running 0 2m44s 192.168.203.72 ip-172-31-43-221.ap-south-1.compute.internal cortx-zookeeper-2 1/1 Running 0 2m44s 192.168.235.14 ip-172-31-41-132.ap-south-1.compute.internal

--------------- hctl status --------------- Wed Aug 10 09:33:09 UTC 2022 Unable to use a TTY - input is not a terminal or the right kind of file Unable to use a TTY - input is not a terminal or the right kind of file Unable to use a TTY - input is not a terminal or the right kind of file Bytecount: critical : 0 damaged : 0 degraded : 0 healthy : 0 Data pool:

fid name

0x6f00000000000001:0x0 'storage-set-1__sns'

Profile:

fid name: pool(s)

0x7000000000000001:0x0 'Profile_the_pool': 'storage-set-1__sns' 'storage-set-1__dix' None

Services: cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x0 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x1 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x2 inet:tcp:cortx-data-g0-0.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x3 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x4 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x5 inet:tcp:cortx-data-g0-1.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x6 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x7 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x8 inet:tcp:cortx-data-g0-2.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x9 inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0xa inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0xb inet:tcp:cortx-data-g1-0.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local (RC) [started] hax 0x7200000000000001:0xc inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0xd inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0xe inet:tcp:cortx-data-g1-1.cortx-data-headless.default.svc.cluster.local@21002 cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0xf inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@22001 [started] ioservice 0x7200000000000001:0x10 inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@21001 [started] confd 0x7200000000000001:0x11 inet:tcp:cortx-data-g1-2.cortx-data-headless.default.svc.cluster.local@21002 cortx-server-0.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x12 inet:tcp:cortx-server-0.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x13 inet:tcp:cortx-server-0.cortx-server-headless.default.svc.cluster.local@22501 cortx-server-1.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x14 inet:tcp:cortx-server-1.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x15 inet:tcp:cortx-server-1.cortx-server-headless.default.svc.cluster.local@22501 cortx-server-2.cortx-server-headless.default.svc.cluster.local [started] hax 0x7200000000000001:0x16 inet:tcp:cortx-server-2.cortx-server-headless.default.svc.cluster.local@22001 [started] rgw_s3 0x7200000000000001:0x17 inet:tcp:cortx-server-2.cortx-server-headless.default.svc.cluster.local@22501 ==================== Time taken for service to start 0 mins ====================

mukul-seagate11 commented 2 years ago

Build is working is expected as per output,

2: Pulling from library/registry 213ec9aee27d: Pull complete 5299e6f78605: Pull complete 4c2fb79b7ce6: Pull complete 74a97d2d84d9: Pull complete 44c4c74a95e4: Pull complete Digest: sha256:83bb78d7b28f1ac99c68133af32c93e9a1c149bcd3cb6e683a3ee56e312f1c96 Status: Downloaded newer image for registry:2 e52b94577079c584e602e2572c9dd1560826d901b4c4209c931847aadd6baea0 Package jq-1.6-2.el7.x86_64 already installed and latest version ● docker.service - Docker Application Container Engine Loaded: loaded (/usr/lib/systemd/system/docker.service; enabled; vendor preset: disabled) Active: active (running) since Fri 2022-08-12 07:21:58 UTC; 15ms ago Docs: https://docs.docker.com Main PID: 31452 (dockerd) Tasks: 46 Memory: 68.6M CGroup: /system.slice/docker.service ├─31452 /usr/bin/dockerd -H fd:// --containerd=/run/containerd/containerd.sock ├─31577 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 8080 -container-ip 172.17... ├─31585 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 8080 -container-ip 172.17.0.2 ... ├─31599 /usr/bin/docker-proxy -proto tcp -host-ip 0.0.0.0 -host-port 5000 -container-ip 172.17... └─31606 /usr/bin/docker-proxy -proto tcp -host-ip :: -host-port 5000 -container-ip 172.17.0.2 ... Aug 12 07:21:56 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:56.65..." Aug 12 07:21:56 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:56.91..." Aug 12 07:21:56 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:56.97..." Aug 12 07:21:57 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:57.21..." Aug 12 07:21:57 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:57.34..." Aug 12 07:21:57 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:57.97..." Aug 12 07:21:57 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:57.99...7 Aug 12 07:21:57 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:57.99..." Aug 12 07:21:58 ip-172-31-45-16.ap-south-1.compute.internal systemd[1]: Started Docker Application Conta.... Aug 12 07:21:58 ip-172-31-45-16.ap-south-1.compute.internal dockerd[31452]: time="2022-08-12T07:21:58.13..." Hint: Some lines were ellipsized, use -l to show in full. Pushing Docker image to GitHub Container Registry The push refers to repository [ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-all] 33183e393d52: Pushed 1fb732fba1e6: Pushed d6a698b755f0: Pushed 99197ca5e6a6: Pushed 99718c5858a2: Pushed 5b4d70426308: Pushed 007cd53edbb3: Pushed 739ab47ecbef: Pushed 271620902409: Pushed d0ecbfefb88e: Pushed 185525670711: Pushed 2.0.0-0: digest: sha256:213a1946d34299a406b48ef3905e20d084ec464c244b13e944206f58eb2459ce size: 2637 Pushing Docker image to GitHub Container Registry The push refers to repository [ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-rgw] 7e3614cc5082: Pushed 752c340083b7: Pushed 9633449eda72: Pushed 2d2b10e5d636: Pushed ed4218753701: Pushed 22c03e72bf7a: Pushed 9b8f5aca104f: Pushed 6f2aef56c6e7: Pushed 3adc5eed0ffe: Pushed d0ecbfefb88e: Mounted from seagate/cortx-all 185525670711: Mounted from seagate/cortx-all 2.0.0-0: digest: sha256:54cc5de9eef223da378d9e901df7a65142534744118c8d1cd1aa7267aedb210a size: 2637 Pushing Docker image to GitHub Container Registry The push refers to repository [ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-data] d88c2138d09a: Pushed 9687de09b2ff: Pushed 1904df840d21: Pushed 4e2cf6a05d0a: Pushed 161b03086236: Pushed 872aa1ff9f19: Pushed f62463c7a4af: Pushed d9cc82604c3e: Pushed 60dbca7e9a4f: Pushed d0ecbfefb88e: Mounted from seagate/cortx-rgw 185525670711: Mounted from seagate/cortx-rgw 2.0.0-0: digest: sha256:107b54e0eec6010be59c0baeceaedaa09751480dfb73a08e65d0c3cb791d4f79 size: 2637 Pushing Docker image to GitHub Container Registry The push refers to repository [ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-control] 3ceec9c758e7: Pushed 8c7876dea914: Pushed da8ea8fc4cc2: Pushed 347feda2e8eb: Pushed f70707a6e606: Pushed e1e42f8eabdf: Pushed d71f2d2c904e: Pushed 4778f19838f2: Pushed a7370b2ab0f1: Pushed d0ecbfefb88e: Mounted from seagate/cortx-data 185525670711: Mounted from seagate/cortx-data 2.0.0-0: digest: sha256:3ff72c01f55c128e1085958588a5de3b7a30f506117d420fd6bff0c5e24114a4 size: 2636 Latest tag creation skipped Latest tag creation skipped Latest tag creation skipped Latest tag creation skipped

[root@ip-172-31-45-16 community-deploy]# docker images REPOSITORY TAG IMAGE ID CREATED SIZE cortx-rgw 2.0.0-0 2bed823e8579 9 minutes ago 1.47GB ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-rgw 2.0.0-0 2bed823e8579 9 minutes ago 1.47GB cortx-data 2.0.0-0 497e1a6879ee 9 minutes ago 1.09GB ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-data 2.0.0-0 497e1a6879ee 9 minutes ago 1.09GB cortx-all 2.0.0-0 9b2f8b8917a7 9 minutes ago 1.19GB ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-all 2.0.0-0 9b2f8b8917a7 9 minutes ago 1.19GB cortx-control 2.0.0-0 3c18acfb5749 11 minutes ago 843MB ip-172-31-45-16.ap-south-1.compute.internal:8080/seagate/cortx-control 2.0.0-0 3c18acfb5749 11 minutes ago 843MB registry 2 3a0f7b0a13ef 2 days ago 24.1MB nginx latest b692a91e4e15 10 days ago 142MB ghcr.io/seagate/cortx-build rockylinux-8.4 d728841607b0 7 weeks ago 1.51GB

[root@ip-172-31-45-16 community-deploy]# docker ps CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES e52b94577079 registry:2 "/entrypoint.sh /etc…" 36 minutes ago Up 36 minutes 0.0.0.0:5000->5000/tcp, :::5000->5000/tcp, 0.0.0.0:8080->8080/tcp, :::8080->8080/tcp local-registry