Open c0debreaker opened 5 years ago
This is probably because some collectors didn't return and kept fds open. Do you have a NFS mount on that system? I suspect the same as #244?
Yes, we do have NFS mounted. I also noticed our open file FD is 1024. Could this number be the cause? I changed it to 65536 for now. However, I am not sure if it will fix it. So far, it's still working and haven't noticed Jenkins freezing.
Today, we had a Jenkins outage. During the outage, I checked /var/log/messages, we have tons of these messages and updating continiously
Then I ran lsof and sorted it to find out who the topmost processes that have open files
We're using AWS EFS. I was thinking that our NFS mount went away. However, running
cat /proc/mounts
still showed it was mounted. I can still netcat to port 2049. However, I was unable to runls /var/lib/jenkins
. When I stopped the jenkins service, that was the only time I was able to access/var/lib/jenkins
.Kernel is
4.16.11-100.fc26.x86_64 #1 SMP Tue May 22 20:02:12 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
Output of
cat /proc/version
isnode_exporter is
ulimit -a