Closed sincerywaing closed 7 years ago
Hi @sincerywaing - this looks very familiar to me. Please let me know: are you running this on a MX80? Is 12.3X48-D40.5
the real version you're working with?
@mirceaulinic yes the verion is correct and I'm using srx240.
Also if no-resolve can be added to the command, the efficiency could be greatly improved.
Are you saying that if you do no-resolve
you are not seeing the same problems?
I suggest you increase the timeout
optional arg to, say 120, and see what happens.
@mirceaulinic adding no-resolve is just a guess, 'cause this would make things much quicker. regardless this issue, I'd suggest we add this as an option. I'll try timeout and get back to you.
adding no-resolve is just a guess,
Okay, there was a bug on MX80 routers (apparently, they don't use the same traceroute binary as MX240, MX480 or MX960) -- not sure it's the same problem here, but the symptom sounds pretty similar:
On MX80 series platform, if executing traceroutes by NETCONF which destination is unresponsive, the processes related will run forever and need to be killed from the shell, otherwise the CPU consumption might go to 100%.
This has been solved in: 14.2R8 15.1X53-D60 15.1R5 15.1F7 16.1R2 16.2R1.
Would you have other physical platforms to compare and check if that's the case you're facing, or it just takes very long to respond?
@mirceaulinic I'll try srx550. meanwhile can you help evaluate the possibility to add no-resolve as kw?
also add to your comment, I'm tracing a destination that is responsive... that's why weird.
meanwhile can you help evaluate the possibility to add no-resolve as kw?
At the moment it's going to be very painful to add this kwarg. But feel free to define it as an additional optional_arg
.
@mirceaulinic confirmed it works fine on srx550. I'll submit a pr to add no-resolve as option_arg meanwhile I tried timeout=120 and it seems work fine. Thanks @mirceaulinic !
As commented in the PR, I don't think we should fix it with an optional_arg. We should either hardcode it to no-resolve or have an argument on the function itself.
For the looks of it, it's not going to solve this issue anyway.
We should either hardcode it to no-resolve
Please check the output first:
In case of success, the keys of the dictionary represent the hop ID, while values are dictionaries containing the probes results:
rtt (float) ip_address (str) host_name (str)
If we hardcode for everyone with no-resolve, ip_address and host_name will be the same for everyone, by default. Bad idea.
closing this out per discussion in pr mentioned.
Description of Issue/Question
when doing a traceroute, program crashes with ncclient timed out while waiting for an rpc reply Also if no-resolve can be added to the command, the efficiency could be greatly improved.
Did you follow the steps from https://github.com/napalm-automation/napalm#faq
[ *] Yes [ ] No
Setup
napalm-junos version
(Paste verbatim output from
pip freeze | grep napalm-junos
between quotes below)JunOS version
(Paste verbatim output from
show version and haiku
between quotes below)Steps to Reproduce the Issue
run a traceroute -
Error Traceback
(Paste the complete traceback of the exception between quotes below)