Open fifofonix opened 3 weeks ago
If your node is in this bad state what happens if you reboot and then don't run docker ps
before trying to run podman
?
Podman bridge networking will work again post the reboot as expected.
Transcript for this on a newly provisioned node.
Fedora CoreOS 41.20241024.1.0
Tracker: https://github.com/coreos/fedora-coreos-tracker
Discuss: https://discussion.fedoraproject.org/tag/coreos
core@d-mini-1:~$ docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
core@d-mini-1:~$ sudo podman run -it fedora:41 curl google.com
Resolved "fedora" as an alias (/etc/containers/registries.conf.d/000-shortnames.conf)
Trying to pull registry.fedoraproject.org/fedora:41...
Getting image source signatures
Copying blob 7dc6d0552bc8 done |
Copying config a7e63d9477 done |
Writing manifest to image destination
curl: (6) Could not resolve host: google.com
core@d-mini-1:~$ sudo systemctl reboot
Broadcast message from root@d-mini-1 on pts/1 (Tue 2024-10-29 12:40:40 UTC):
The system will reboot now!
core@d-mini-1:~$ Shared connection to 172.25.81.77 closed.
(pdm-3.12) me@MY-MAC:~/.ssh$ ssh 172.25.81.77
Fedora CoreOS 41.20241024.1.0
Tracker: https://github.com/coreos/fedora-coreos-tracker
Discuss: https://discussion.fedoraproject.org/tag/coreos
Last login: Tue Oct 29 12:39:31 2024 from 156.111.209.171
core@d-mini-1:~$ sudo podman run -it fedora:41 curl google.com
<HTML><HEAD><meta http-equiv="content-type" content="text/html;charset=utf-8">
<TITLE>301 Moved</TITLE></HEAD><BODY>
<H1>301 Moved</H1>
The document has moved
<A HREF="http://www.google.com/">here</A>.
</BODY></HTML>
core@d-mini-1:~$
I should also note that I am running several long-lived next
nodes that run podman/docker concurrently without issue, so maybe some legacy settings carried forward in older nodes when they upgrade works fine, but that newly provisioned nodes defaulting to some newer networking aspects are having the issue...
@fifofonix considering it's against recommendation and also corrects itself on reboot I'm not sure there is much for us to do here other than characterize the problem so if others encounter it we can point them here.
Do you agree?
I think because it is against recommendation just characterizing the problem is fine.
Correcting the problem on reboot is only until someone uses docker. I can anticipate that a docker user seeking to experiment with podman for a oneshot systemd unit, on one of their docker nodes, and not realising that parallel use is not recommended, is going to end up confused and give up on their experiment.
If there were some more details on how to separate namespaces that would be useful to me.
Personally, I used podman to run some higher security workloads with additional capabilities that I did not want to expose to the docker process. For now these dual-workload nodes run fine but only until I choose to reprovision them. I will have to relocate worklods which is not an issue now that I know the issue at hand.
Describe the bug
Podman bridge network created for a new container does not operate successfully IF the node has first run a docker command (as simple as
docker ps
). Can't even access DNS servers.Note: Concurrent docker/podman usage is not recommended per FCOS FAQ for several years. However, up until now operations as simple as this have succeeded allowing users in most cases to use both in parallel (up to current
test
)Reproduction steps
next
node on any platform VMWare/AWS.sudo podman run -it fedora:41 curl -v google.com
# Succeedsdocker ps
activating docker and possibly triggering some networking operation of some sortsudo podman run -it fedora:41 curl -v google.com
# FailsExpected behavior
Podman container can access the internet
Actual behavior
Podman container can not even access DNS servers.
System details
Butane or Ignition config
Additional information
Non-working output: