Closed smith closed 12 years ago
Are you also defining override_ip? If not, try defining override_ip to be the IP address of the spoofed name.
Nathan,
perhaps you can give me the actual names on the IRC channels. I should be able to replicate this.
@jbuchbinder override_ip is defined. Here's the globals section:
globals {
daemonize = yes
setuid = yes
user = ganglia
debug_level = 10
max_udp_msg_len = 1472
mute = no
deaf = no
host_dmax = 0 /*secs */
cleanup_threshold = 300 /*secs */
gexec = no
send_metadata_interval = 60
override_hostname = web02.example.com
override_ip = 10.4.85.168
}
@vvuksan I sent you the domain name through the email on your website.
@vvuksan That email didn't go through. The domains it was failing on were web02.who.is and web03.who.is
We recently committed a change to use an APR method rather than the manual string concatenation we had been doing before, so that should fix this. Could someone confirm?
I've got this installed on a server, and FYI when using the passenger module from gmond-python-modules it fails on startup with:
{'status': 'sudo /usr/bin/passenger-status', 'memory_stats': 'sudo /usr/bin/passenger-memory-stats', 'metrix_prefix': 'passenger'}
apr_pollset_create failed: Invalid argument
That could be a bug in the python module, so I disabled it, and then I get:
apr_pollset_create failed: Invalid argument
after loading the python module. This could be a problem with some module or configuration (I ran it with sudo /usr/local/sbin/gmond -d 2 --conf=/etc/ganglia/gmond.conf
and installed the new version with --prefix=/usr/local
.
I'll let you know if I can try it on a non-production machine with no extra modules. Let me know if there is anything else I can do to help you confirm.
Are you sure you have valid UDP and/or TCP listeners defined on that instance?
http://www.mail-archive.com/ganglia-developers@lists.sourceforge.net/msg05559.html
Thanks for that. I set set it to deaf. I'm running the commit for #49 shown above and it seems to fix the problem! Thanks
On a node using
override_hostname = web02.example.com
(not the real domain; I can give that to you if needed), gmond crashes after running for a short time with a message like this:There are instances with names manager00.example.com, ops00.example.com, worker00.example.com, worker01.example.com, web01.example.com, web02.example.com, web03.example.com.
It only fails on web02 and web03, which are identical in every way except the name to web00 and web01.
(It crashes immediately if I include configuration for python modules btw)
Running on an EC2 m1.large instance on Ubuntu 10.04.4 LTS, gmond 3.3.8 (from the package at https://launchpad.net/~rufustfirefly/+archive/ganglia)