We are running NAV 3.13.0 since 08.02.13
Some days ago something happened in our environment that caused eventengine to
throw an error:
2013-02-13 17:08:57,887 [ERROR nav.eventengine.engine] Unhandled exception in
plugin <nav.eventengine.plugins.boxstate.BoxStateHandler object at 0x804e170d0>
; ignoring it
Traceback (most recent call last):
File "/usr/local/nav/lib/python/nav/eventengine/engine.py", line 190, in
handle_event
handler.handle()
File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py",
line 52, in handle
return self._handle_start()
File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py",
line 68, in _handle_start
self._set_internal_state_down()
File "/usr/local/nav/lib/python/nav/eventengine/plugins/boxstate.py", line
29, in _set_internal_state_down
shadow = self._verify_shadow()
File "/usr/local/nav/lib/python/nav/eventengine/plugins/delayedstate.py",
line 189, in _verify_shadow
netbox.up = (Netbox.UP_DOWN if netbox_appears_reachable(netbox)
File "/usr/local/nav/lib/python/nav/eventengine/topology.py", line 32, in
netbox_appears_reachable
target_path = get_path_to_netbox(netbox)
File "/usr/local/nav/lib/python/nav/eventengine/topology.py", line 82, in
get_path_to_netbox
path = networkx.shortest_path(graph, netbox, router)
File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/
algorithms/shortest_paths/generic.py", line 124, in shortest_path
paths=nx.bidirectional_shortest_path(G,source,target)
File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/
algorithms/shortest_paths/unweighted.py", line 138, in
bidirectional_shortest_path
results=_bidirectional_pred_succ(G,source,target)
File "/usr/local/lib/python2.7/site-packages/networkx-1.6-py2.7.egg/networkx/
algorithms/shortest_paths/unweighted.py", line 205, in _bidirectional_pred_succ
raise nx.NetworkXNoPath("No path between %s and %s." % (source, target))
NetworkXNoPath: No path between <server> and <router>
This happened for 5 servers all on the same GSW.
Four days later 4 of them are still marked as down even though ipdevinfo
reports availability numbers like the following and the servers clearly are up:
Availability 100.00% last day, 99.77% last week, 99.94% last month
Where do I have to push (or kick) to make NAV recognise the servers as up?
--Ingeborg
--
Ingeborg Østrem Hellemo -- ingeborg.hellemo(a)uit.no
Dep. of Information Technology --- Univ. of Tromsø