Open flo-renaud opened 2 years ago
The only recent change to the sasl code was in:
https://github.com/389ds/389-ds-base/pull/5167
Which fixed a memory leak, but perhaps it is related to this. There is also a new F36 DS build in testing: 389-ds-base-2.1.3
https://koji.fedoraproject.org/koji/taskinfo?taskID=84603435
https://bodhi.fedoraproject.org/updates/FEDORA-2022-917bd01497
It would be good to know if the latest version is also randomly failing.
@mreynolds389 we will keep you updated if this random failure is seen with the new DS build.
Issue Description FreeIPA nightly tests randomly fail trying to setup the replication. See for instance this test report with the following logs.
Package Version and Platform:
Steps to Reproduce Steps to reproduce the behavior:
ipa-server-install --domain ipa.test --realm IPA.TEST -a Secret123 -p Secret123 --setup-dns --auto-forwarders --auto-reverse -U
ipa-client-install --domain ipa.test --realm IPA.TEST -p admin -w Secret123 --server server.ipa.test -U
kinit admin; ipa-replica-install -U
The replica installation fails randomly.Expected results Replica installation should succeed.
Initial investigation The replica installation fails in the step setting up initial replication, with the following error:
The replica installer performs the following steps:
create a connection to the master, bind as
fqdn=replica0.ipa.test,cn=computers,cn=accounts,dc=ipa,dc=test
fetch nsDS5ReplicaId from the master
increment and update the value on the master
add replica config on the replica in
cn=replica,cn=dc\3Dipa\2Cdc\3Dtest,cn=mapping tree,cn=config
set changelog maxage to 30d on the replica
on the replica, create a special user to let SASL mapping find a valid user on first replication:
cn=ldap/master.ipa.test@IPA.TEST,cn=config
and add this user to nsDS5ReplicaBindDNon the replica, create a SASL mapping
cn=Peer Master,cn=mapping,cn=sasl,cn=config
This will map a kerberos principal
ldap/master.ipa.test@IPA.TEST
to the entrycn=ldap/master.ipa.test@IPA.TEST,cn=config
.add replica config on the master in
cn=replica,cn=dc\3Dipa\2Cdc\3Dtest,cn=mapping tree,cn=config
set changelog maxage to 30d on the master
on the master, make sure the group
cn=replication managers,cn=sysaccounts,cn=etc,dc=ipa,dc=test
exists and contains the principals for master and replicacreate repl agreement on the master
cn=meToreplica0.ipa.test,cn=replica,cn=dc\=ipa\,dc\=test,cn=mapping tree,cn=config
create repl agreement on the replica
cn=meTomaster.ipa.test,cn=replica,cn=dc\=ipa\,dc\=test,cn=mapping tree,cn=config
start replication by setting
nsds5BeginReplicaRefresh=start
on the master (entrycn=meToreplica0.ipa.test,...
)read the entry and check if the replication has started. This fails. The audit logs show the MOD operation is happening at 20220704162708. After that, a search is done every second to check the replication status but the replication fails to start. The master's error log shows the following error:
and the replica access log shows the master trying to connect but failing:
The connection should be authenticated as
cn=ldap/master.ipa.test@IPA.TEST,cn=config
as the master is using its kerberos principal. It seems that the SASL mapping is not working.Tracker on freeipa side: https://pagure.io/freeipa/issue/9196