Closed rushabh268 closed 4 years ago
Can you test this PR? https://github.com/openshift/openshift-ansible/pull/4991
@sdodson That PR won't handle adding -v /opt/cni/bin:/opt/cni/bin -v /etc/cni/net.d:/etc/cni/net.d
to the node.service file. I have tested that PR using the above workaround that I mentioned in the bug, @rparulek & @vareti are aware about this issue.
@sdodson We did try using our 4991 PR on atomic hosts but it fails at CNI pod resolution with the following error:
Error syncing pod, skipping: failed to "SetupNetwork" for "router-3-deploy_default" with SetupNetworkError: "Failed to setup network for pod "router-3-deploy_default(799d976f-1502-11e7-b3c1-fa163e10ef43)" using network plugins "cni": failed to send CNI request: Post http://dummy/: dial unix /var/run/openshift-sdn/cni-server.sock: connect: no such file or directory
This is the reason we used the workaround that was mentioned for Calico in this issue : https://github.com/openshift/openshift-ansible/issues/3805
The work around was we had to add the following host to container mappings in the file "/etc/systemd/system/docker.service.wants/atomic-openshift-node.service" on all our atomic nodes manually:
-v /var/usr/share/vsp-openshift:/var/usr/share/vsp-openshift -v /etc/default:/etc/default -v /var/run:/var/run -v /opt/cni/bin:/opt/cni/bin -v /etc/cni/net.d:/etc/cni/net.d
Are we missing anything here? Do we need to use some other method of exposing the cni bin/net directories?
@sdodson If we want to achieve the mounting we want on atomic nodes to get Nuage CNI working on Atomic hosts; is there a way we can pass additional Nuage needed docker mounts using "$DOCKER_ADDTL_BIND_MOUNTS" parameter somehow via openshift-ansible in line https://github.com/openshift/openshift-ansible/blob/master/roles/openshift_node/templates/openshift.docker.node.service#L24. Will that be possible to achieve ?
FYI, We are currently using an older openshift-ansible tag (openshift-ansible-3.6.128-1), wherein we are adding the above mentioned Nuage specific mounts in file "/etc/systemd/system/docker.service.wants/atomic-openshift-node.service" on all atomic hosts. I assume this file corresponds to file https://github.com/openshift/openshift-ansible/blob/master/roles/openshift_node/templates/openshift.docker.node.service on openshift-ansible master branch today, right?
@sdodson @rushabh268 I have created an upstream PR https://github.com/openshift/openshift-ansible/pull/5220 to handle the adding of the custom Nuage docker mounts to atomic-openshift-node service during the Nuage installation time itself.
@sdodson The PR I had created above did not seem to fix the issue wherein we like to add our custom Nuage docker mounts for the CNI plugin to be functional. It is because there is another openshift dep service file which sets the $DOCKER_ADDTL_BIND_MOUNTS environment variable as here : https://github.com/openshift/openshift-ansible/blob/4338dce09dbe5497f2a3700992eb4c5afeb4e6f6/roles/openshift_node/templates/openshift.docker.node.dep.service#L9 .
Is there a way you suggest this can be handled in openshift-ansible for adding these extra mounts for atomic-openshift-node.service? Any pointers in this context will be greatly appreciated!
Many Thanks!
Issues go stale after 90d of inactivity.
Mark the issue as fresh by commenting /remove-lifecycle stale
.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen
.
If this issue is safe to close now please do so with /close
.
/lifecycle stale
Stale issues rot after 30d of inactivity.
Mark the issue as fresh by commenting /remove-lifecycle rotten
.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen
.
If this issue is safe to close now please do so with /close
.
/lifecycle rotten /remove-lifecycle stale
Rotten issues close after 30d of inactivity.
Reopen the issue by commenting /reopen
.
Mark the issue as fresh by commenting /remove-lifecycle rotten
.
Exclude this issue from closing again by commenting /lifecycle frozen
.
/close
@openshift-bot: Closing this issue.
Description
Nuage CNI fails to setup network for any pod with Openshift Enterprise on Atomic
Version
Please put the following version information in the code block indicated below.
ansible --version
If you're operating from a git clone:
git describe
If you're running from playbooks installed via RPM or
atomic-openshift-utils
rpm -q atomic-openshift-utils openshift-ansible
Steps To Reproduce
Expected Results
Describe what you expected to happen.
Pod should be resolved in the CNI network but instead fails with the error shown below
Observed Results
Openshift is not able to invoke the Nuage CNI plugin due to the following error:
Additional Information
Provide any additional information which may help us diagnose the issue.
$ cat /etc/redhat-release
)[root@ovs-1 ~]# cat /etc/redhat-release Red Hat Enterprise Linux Atomic Host release 7.3
[root@ovs-12 openshift-ansible]# cat /etc/redhat-release Red Hat Enterprise Linux Server release 7.4 (Maipo)
Your inventory file (especially any non-standard configuration parameters)
Work-around: