ClusterLabs / anvil

The Anvil! Intelligent Availability™ Platform, mark 3
5 stars 6 forks source link

drbd config not generated #341

Closed fabbione closed 1 year ago

fabbione commented 1 year ago

from CI, attempting to deploy server 1, it fails.

From the machines, drbd config is missing node02 ipaddress:

[root@an-a01n01 ~]# lvs
  LV                VG            Attr       LSize  Pool Origin Data%  Meta%  Move Log Cpy%Sync Convert
  an-test-deploy1_0 anvil-test-vg -wi-a----- 10.00g

[root@an-a01n01 ~]# drbdadm status
drbd.d/an-test-deploy1.res:27: Parse error: 'ssocks | sdp | ipv4 | ipv6 | <ipv4 address> ' expected,
        but got ':'

[root@an-a01n01 ~]# cat /etc/drbd.d/an-test-deploy1.res
# Resource for an-test-deploy1
resource an-test-deploy1 {
        on an-a01n01 {
                node-id 0;
                volume 0 {
                        device    /dev/drbd_an-test-deploy1_0 minor 0;
                        disk      /dev/anvil-test-vg/an-test-deploy1_0;
                        meta-disk internal;
                }
        }

        on an-a01n02 {
                node-id 1;
                volume 0 {
                        device    /dev/drbd_an-test-deploy1_0 minor 0;
                        disk      /dev/anvil-test-vg/an-test-deploy1_0;
                        meta-disk internal;
                }
        }

        ### NOTE: Remember to open the appropriate firewall port!
        # firewall-cmd --zone=SN1 --permanent --add-port=7788/tcp --permanent
        # firewall-cmd --zone=SN1 --permanent --add-port=7788/tcp

        connection {
                host  an-a01n01  address  10.101.4.1:7788;
                host  an-a01n02  address  :7788;
                disk {
                        # Without this, the variable bit rate caps at 100 MiB/sec, and most deployments are
                        # 10 Gbps. So this lets the variable rate climb to 500 MiB/sec
                        c-max-rate 500M;
                }
                net {
                        protocol C;
                        verify-alg md5;
                        fencing resource-and-stonith;
                }
        }
}

Complete anvil logs available here for download: http://anvil-ci-repo.ci.alteeve.com/testing-logs/5721.tar.gz

fabbione commented 1 year ago

Logs that include etc/hosts http://anvil-ci-repo.ci.alteeve.com/testing-logs/drbd-failure.tar.gz

fabbione commented 1 year ago

this might actually be a bug in CI,I am verifying now.

fabbione commented 1 year ago

I am confident this was a bug in CI deployment configuration. Closing for now, I will reopen if it happens again.

digimer commented 1 year ago

pr#344 should catch failures like this and fail gracefully.

fabbione commented 1 year ago

No syntax error now, double checked CI is NOT touching /etc/hosts, drbd is not configured (lvs is).

https://ci.kronosnet.org/job/anvil-build-functional-testing/5797/anvil-build-all-voting-functional-testing=anvil-bm/console

fabbione commented 1 year ago

haven´t seen this happening in a while now. I will reopen if it happens again

fabbione commented 1 year ago

I jinx´ed it :P

http://anvil-ci-repo.ci.alteeve.com/testing-logs/5837.tar.gz

digimer commented 1 year ago

I think I found the source, working on a solution now.

digimer commented 1 year ago

@fabbione, is this clear to close?

fabbione commented 1 year ago

Yes we can close.