We are running NAV 3.13.0 since 08.02.13
Some days ago something happened in our environment that caused eventengine to throw an error:
2013-02-13 17:08:57,887 [ERROR nav.eventengine.engine] Unhandled exception in plugin <nav.eventengine.plugins.boxstate.BoxStateHandler object at 0x804e170d0> ; ignoring it Traceback (most recent call last): File "/usr/local/nav/lib/python/nav/eventengine/engine.py", line 190, in handle_event handler.handle() File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py", line 52, in handle return self._handle_start() File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py", line 68, in _handle_start self._set_internal_state_down() File "/usr/local/nav/lib/python/nav/eventengine/plugins/boxstate.py", line 29, in _set_internal_state_down shadow = self._verify_shadow() File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py", line 189, in _verify_shadow netbox.up = (Netbox.UP_DOWN if netbox_appears_reachable(netbox) File "/usr/local/nav/lib/python/nav/eventengine/topology.py", line 32, in netbox_appears_reachable target_path = get_path_to_netbox(netbox) File "/usr/local/nav/lib/python/nav/eventengine/topology.py", line 82, in get_path_to_netbox path = networkx.shortest_path(graph, netbox, router) File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/ algorithms/shortest_paths/generic.py", line 124, in shortest_path paths=nx.bidirectional_shortest_path(G,source,target) File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/ algorithms/shortest_paths/unweighted.py", line 138, in bidirectional_shortest_path results=_bidirectional_pred_succ(G,source,target) File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/ algorithms/shortest_paths/unweighted.py", line 205, in _bidirectional_pred_succ raise nx.NetworkXNoPath("No path between %s and %s." % (source, target)) NetworkXNoPath: No path between <server> and <router>
This happened for 5 servers all on the same GSW.
Four days later 4 of them are still marked as down even though ipdevinfo reports availability numbers like the following and the servers clearly are up:
Availability 100.00% last day, 99.77% last week, 99.94% last month
Where do I have to push (or kick) to make NAV recognise the servers as up?
--Ingeborg