389ds / 389-ds-base

The enterprise-class Open Source LDAP server for Linux
https://www.port389.org/
Other
212 stars 91 forks source link

Random failure in ipa-replica-install while setting up the replication #5361

Open flo-renaud opened 2 years ago

flo-renaud commented 2 years ago

Issue Description FreeIPA nightly tests randomly fail trying to setup the replication. See for instance this test report with the following logs.

Package Version and Platform:

Steps to Reproduce Steps to reproduce the behavior:

  1. on the master, install ipa server with ipa-server-install --domain ipa.test --realm IPA.TEST -a Secret123 -p Secret123 --setup-dns --auto-forwarders --auto-reverse -U
  2. on the replica, install an ipa client with ipa-client-install --domain ipa.test --realm IPA.TEST -p admin -w Secret123 --server server.ipa.test -U
  3. on the replica, promote the machine as a replica with kinit admin; ipa-replica-install -U The replica installation fails randomly.

Expected results Replica installation should succeed.

Initial investigation The replica installation fails in the step setting up initial replication, with the following error:

...
  [27/42]: creating DS keytab
  [28/42]: ignore time skew for initial replication
  [29/42]: setting up initial replication
Starting replication, please wait until this has completed.

Update in progress, 1 seconds elapsed
Update in progress, 2 seconds elapsed
Update in progress, 3 seconds elapsed
Update in progress, 4 seconds elapsed
Update in progress, 5 seconds elapsed
Update in progress, 6 seconds elapsed
Update in progress, 7 seconds elapsed
Update in progress, 8 seconds elapsed
Update in progress, 9 seconds elapsed
Update in progress, 10 seconds elapsed
Update in progress, 11 seconds elapsed
Update in progress, 12 seconds elapsed
Update in progress, 13 seconds elapsed
Update in progress, 14 seconds elapsed
Update in progress, 15 seconds elapsed
[ldap://master.ipa.test:389] reports: Update failed! Status: [Error (49) - LDAP error: Invalid credentials - no response received]

  [error] RuntimeError: Failed to start replication
Failed to start replication
The ipa-replica-install command failed. See /var/log/ipareplica-install.log for more information
Your system may be partly configured.
Run /usr/sbin/ipa-server-install --uninstall to clean up.

The replica installer performs the following steps:

Tracker on freeipa side: https://pagure.io/freeipa/issue/9196

mreynolds389 commented 2 years ago

The only recent change to the sasl code was in:

https://github.com/389ds/389-ds-base/pull/5167

Which fixed a memory leak, but perhaps it is related to this. There is also a new F36 DS build in testing: 389-ds-base-2.1.3

https://koji.fedoraproject.org/koji/taskinfo?taskID=84603435

https://bodhi.fedoraproject.org/updates/FEDORA-2022-917bd01497

It would be good to know if the latest version is also randomly failing.

flo-renaud commented 2 years ago

@mreynolds389 we will keep you updated if this random failure is seen with the new DS build.