Closed paliasgh closed 1 month ago
Hello @AntonioConsilvio @sgiraz @pattacini , this is a gentle reminder about this issue. It is now happening almost every time we work with the robot, unfortunately. Thank you very much in advance.
Hello again @AntonioConsilvio @sgiraz @pattacini, I just wanted to again kindly remind you about our issue. Thanks a lot for your help.
Hi @paliasgh, during this period we had to focus on completing tasks with tighter deadlines. However, we have not forgotten about this issue and in the coming period we intend to plan the support service!
Hi @paliasgh, sorry for the late reply!
From the information unfortunately I am not yet able to provide a sure solution, so I need some information, but I can also give some tips that might be helpful:
Looking at the nature of the problem, I can think of two probable causes:
1) The problem could be the Ethernet connection.
Since the first board to complain about this problem seems to be the MC4-PLUS eb25 (10.0.1.25
), I would check the ETH connection between the EB24 board and the EB25 board (cable name E24).
2) The problem could be the power supply of the EB25 board (10.0.1.25). If the board has a damaged power supply cable, this may cause the board to suddenly switch off and thus cause the ETH line to drop out.
To check the status of the eb25 board, you should remove the upperarm cover (if you don't know how, I can provide instructions) and find 3 MC4-PLUS boards (specifically eb24, eb25 and eb26). The ETH cable (E24) will be a very short wire connecting the EB24 (the one closest to the mechanics) and EB25 (the middle one):
instead, to check the power supply you should check the yellow and black cable connected to the EB25 board:
I advise you to check the status of these two cables, and check if at the time the problem occurs, the eb25 board reboots or remains switched on.
I am not clear what physically happens to the robot when the error appears. Does the robot remain frozen and can no longer move from yarpmotorgui
? And if so, does the whole robot remain stuck (including the torso and legs)?
Does this problem only occur when the Cartesian is active?
When the problem happens, try starting FirmwareUpdater
on the robot head (entering with -X as it is a graphical application) and run the discover
procedure of the ETH boards to see the boards` ethernet communication at that time.
Please, send us feedback and if you have any questions feel free to ask!
Hi @AntonioConsilvio
Let me add some insights regarding
-I am not clear what physically happens to the robot when the error appears. Does the robot remain frozen and can no longer move from yarpmotorgui? And if so, does the whole robot remain stuck (including the torso and legs)?
-Does this problem only occur when the Cartesian is active?
The long comm timeout occurring on the YARP network of the right arm, during which we don't have fresh updates of the encoders, causes the Cartesian solver to pause. This is a safety countermeasure implemented by the SW as the system is clearly unstable. At that point, the solver can receive a message to restart its operations but the timeout will occur again.
Other pieces of SW (e.g., the yarpmotorgui
) may not rely on the same safety protection and can stay operational.
Hello @AntonioConsilvio and @pattacini , thanks you for your help!
I opened the left upperarm cover to see the boards. I am currently running things with the robot to see what happens when the Cartesian stops, however, it seems like it is not going to fail at all when the cover is not there! It has been about one hour that the robot is functioning without any problem. In this case, is it most likely a problem with cables when they are compressed? I then need to check them more carefully.
-- UPDATE: I tried pressing/moving those cables with a narrow plastic stick to see which one causes the problem. It seems like the problem is cable E25. It is the one that connects eb25 to eb26. As it is not a power cable, when the failure happens (errors appears in the log) the boards remain on (actually no light or etc. changes in the boards). Looking at E25, especially the connectors, things look visually okay, but not too great. Is there any other way we can verify that the issue is E25, not E24, before going forward with replacing the cable? Or maybe replacing both E24 and E25 is the best? Any advise? Thanks!
Thanks for the insight @pattacini!
Hi @paliasgh! I considered the damage to the E24 cable more probable, however I would not exclude the possibility, that the E25 could also cause the problem.
There is no problem with double replacement (both E24 and E25).
However, if you want to clarify which of the two cables is giving problems there are some tests that you can do:
1) Check the ETH LED on the board (when the error appears). In fact, the MC4-PLUS has a led behind the ETH connectors, which when switched on confirms correct ETH communication:
![IMG_20240422_141511](https://github.com/robotology/icub-tech-support/assets/114915464/696c5c66-4ce4-4e6e-9c32-92da1e307417)
2) Test the continuity of the cable with the multimeter.
3) You can try testing the crimping of the cable with tweezers, as in this video:
https://github.com/robotology/icub-tech-support/assets/114915464/801adf61-1312-4a0f-99fa-3175fac4bf6a
You can try pulling the cable with tweezers, without tearing it, just to see if the crimp still holds.
Please, send us feedback and if you have any questions feel free to ask!
Hello @AntonioConsilvio @pattacini, thanks a lot for all your support!
Last week, we replaced both E24 and E25 with pre-assembled cables (we had to reverse the order of wires in one head). The robot seems to be working without any major issue, with the arm covers all attached, since then! The Cartesian interface only failed 3 times during more than 50 hours of work. It maybe related to other things, I could not check the log since it happened very rarely.
Checking the original cables, I find out the issue was actually with the red wire of E25 cable (the video below):
https://github.com/robotology/icub-tech-support/assets/59978262/ddd0c937-dad6-48d9-aee5-7491c1a9d419
Again, many thanks for your help!
Robot Name 🤖
iCubWaterloo01 S/N:044
Request/Failure description
Hello,
As suggested, I am moving the discussion about the Cartesian solver stop working to here.
As origninally mentioned, often the robot randomly freezes and we need to restart Cartesian interfaces. Errors like this appear iKinCartesianSolver of left_arm:
At the same time, yarprobotinterface starts publishing these messages/errors:
When I try to stop the modules in startup application,
iKinCartesianSolver
ofright_arm
andyarprobotinterface
both fail to close properly and I will have to kill them.Thanks for all the assistance.
Detailed context
Here is a new, complete log file of startup application:
yarprunlog_29_02_2024_16_15_42.log
I ran everything on
icub-head
computer to see if the problem is in the communication betweenlocal-host
andicub-head
. The problem was still there. The issue only happens in the left arm. Here is also how the system status look like. There was a sudden drop in the network usage right at the same time when the issue happened:Additional context
No response
How does it affect you?
No response