Open kyuferev opened 5 years ago
I've just catched another UPS getting disconnected with "short read from UPS" errors in syslog. But this time UPS got back after upsdrvctl start upsname
with two Communications with UPS lost: Receive error (Requested only mode command): 4!!!
errors
And usbhid-ups device failed with Can't connect to UPS [ups-8] (usbhid-ups-ups-8): No such file or directory
. upsdrvctl:
root@upsmon:~# upsc ups-8
Init SSL without certificate database
Error: Driver not connected
root@upsmon:~# upsdrvctl start ups-8
Network UPS Tools - UPS driver controller 2.7.2
Network UPS Tools - Generic HID driver 0.38 (2.7.2)
USB communication driver 0.32
No matching HID UPS found
Driver failed to start (exit status=1)
Network UPS Tools - Generic HID driver 0.38 (2.7.2)
USB communication driver 0.32
No matching HID UPS found
Driver failed to start (exit status=1)
Network UPS Tools - Generic HID driver 0.38 (2.7.2)
USB communication driver 0.32
No matching HID UPS found
Driver failed to start (exit status=1)
Hi
There is no specific limit on the nut side, but usb may be drawing too much power if your hubs are not self powered.
Try lowering pollinterval in ups.conf. the ups may be flooded. Also check the battery test period. Look around MAXAGE in upsd.conf and possibly upsmon.conf
Cheers
Btw, some hard reset of the units may be required to get back to a sane situation. Power down, unplug the power cable, count to 10, ...
1. There is no specific limit on the nut side, but usb may be drawing too much power if your hubs are not self powered.
All hubs are self powered so it isn't a problem.
2. Try lowering pollinterval in ups.conf. the ups may be flooded. Also check the battery test period. Look around MAXAGE in upsd.conf and possibly upsmon.conf
But if I'll lower pollinterval value there will be even more requests sent to the UPS or am I getting it wrong? Right now I have
pollinterval = 10
inups.conf
andMAXAGE 25
inupsd.conf
. I've tried to fix the problem by modifying this parameters but no success. Should I increaseMAXAGE
parameter even more?
I should also note that I have zabbix-agent installed on this board and he is the one that collects data from the NUT. There are alot (100+) custom parameters configured and they are pulled every 15 seconds.
Btw, some hard reset of the units may be required to get back to a sane situation. Power down, unplug the power cable, count to 10, ...
Unfortunately that's impossible because of large amount of servers that are connected to these UPSes.
UPS got disconnected again. No errors in dmesg, all /dev/ttyUSB devices are in place. In syslog:
bcmxcp[1450]: Communications with UPS lost: Error executing command
bcmxcp[1450]: Short read from UPS
upsd[1452]: Data for UPS [ups-12] is stale - check driver
No response from UPS:
~# upsdrvctl -DDD start ups-12
Network UPS Tools - UPS driver controller 2.7.2
0.000000
0.004203 Starting UPS: ups-12
0.004715 3 remaining attempts
0.005061 exec: /lib/nut/bcmxcp -a ups-12
Network UPS Tools - BCMXCP UPS driver 0.28 (2.7.2)
RS-232 communication subdriver 0.20
No response from UPS on /dev/ttyUSB6 with baudrate 9600
Attempting to autodect baudrate
Can't connect to the UPS on port /dev/ttyUSB6!
44.629455 Driver failed to start (exit status=1)
49.629867 2 remaining attempts
49.630052 exec: /lib/nut/bcmxcp -a ups-12
Network UPS Tools - BCMXCP UPS driver 0.28 (2.7.2)
RS-232 communication subdriver 0.20
No response from UPS on /dev/ttyUSB6 with baudrate 9600
Attempting to autodect baudrate
Can't connect to the UPS on port /dev/ttyUSB6!
94.259703 Driver failed to start (exit status=1)
99.260083 1 remaining attempts
99.260269 exec: /lib/nut/bcmxcp -a ups-12
Network UPS Tools - BCMXCP UPS driver 0.28 (2.7.2)
RS-232 communication subdriver 0.20
No response from UPS on /dev/ttyUSB6 with baudrate 9600
Attempting to autodect baudrate
Can't connect to the UPS on port /dev/ttyUSB6!
143.889160 Driver failed to start (exit status=1)
Hi there. Pre-history: #597 I have a setup of one OrangePi One board with 3 USB hubs (each hub has its own PSU) connected to each other and 17 UPSes to monitor. Most of them are Eaton PW9120 6000i and there also are some PW9130. PW9120 are connected to USB hubs via RS232-USB cable based on PL2303 chip and are using bcmxcp driver. PW9130 are connected via USB A - USB B cables and are using usbhid-ups driver. I'm using custom shell script to remake ups.conf after each reboot and I also have zabbix-agent installed with a bunch of user parameters. And I have two major problems with this setup:
to a syslog. It always starts with one (not the same one tho) UPS but some time after first failure other bcmxcp UPSes start to fail too. Sometimes this failures are accompanied with disconnects from
/dev/ttyUSB*
devices. In this case I'm also gettingport N disabled by hub (EMI?), re-enabling...
and thenEnable to enumerate USB device
errors in syslog. I must admit that the whole setup is placed in a harsh environment with a lot of electromagnetic interferences around. But some months ago when I was just testing this kind of a setup everything worked fine and stable and I was able to acquire all the data I needed.I've already tried to switch cables/hubs/OrangePi board/PSUs but no success, the same scenario all the way. I've run out of ideas how to fix this. What can I do to get more debug data? Why bcmxcp devices keep disconnecting?