Handling an Overlay Node Failure When a node
is unreachable through both the paths i.e the direct
and the alternate path then it is assumed to be failed.
When a child C detects that it’s parent P has failed
then it sends its grand parent the node failure event
of P. The failure of node P can also be detected by
the parent of P. In both the cases the parent of the
failed node performs the Node
Leave() operation of P
on behalf of P.
4.2 Proof of Correctness
Lemma: (i) In a hierarchical network in which ev-
ery broker node has two node disjoint paths x and y
to its parent node as well as a path z to its parent’s
parent, such that it is node disjoint from x and y and
does not contain the parent node, every node remains
connected to the root node in the event of failure of a
single physical node or link.
(ii) Moreover, if a new broker is added to the hierar-
chical network by forming three physical paths, x’, y’
to its parent and z’ to its parent’s parent such that x’,
y’ and z’ are pairwise node disjoint and z’ does not
contain the parent of the new node, then the resultant
network is also tolerant to single node and link fail-
ures.
Proof: Consider any non root broker node b in the
hierarchical network. We show that it remains con-
nected to the root node r in the event of failure of a
single node s (s6=b) or single link l.
I. If Node s fails
Case 1: s is b’s parent. Node b still has a path z to
node s’s parent, which does not contain node s, and
the network being hierarchical, the path from parent
of s to r does not contain s.
Case 2: Node s is b’s parent’s parent. Node b has
paths x and y to its parent, which is connected to its
own parent’s parent via a path independent of s, and
as the network is hierarchical, all the way to the root.
Case 3: Node s is not a broker node. If node s does
not occur in the physical path from b to r then its fail-
ure cannot affect the connectivity of b to r. If it ex-
ists in the path, then let (b,p
1
, p
2
... p
n
, r), be the path
along parent nodes from b to r. For any overlay edge
p
i
, p
i+1
along this path, the failure of s does not affect
the connectivity of p
i
to p
i+1
, as there is an alternate
physical path which does not contain s.
II. If Link l fails
If link l does not occur in the physical path from b
to r then its failure cannot affect the connectivity of b
to r. If it exists in the path, then let (b, p
1
, p
2
... p
n
, r),
be the path along parent nodes from b to r. For any
overlay edge p
i
, p
i+1
along this path, the failure of l
does not affect the connectivity of p
i
to p
i+1
, as there
is an alternate physical path which does not contain
link l.
5 SIMULATIONS
Experimental Setup The experimental setup con-
sists of a simple network simulator for event based
middleware. The Simulator, developed in java, mod-
els all the basic network features like delay, band-
width and loss of data. The application data is con-
verted into simulation events and kept in a simu-
lation event queue and then processed according to
their attached time stamps. The time stamp is as-
signed according to the delay for data to get trans-
ferred from the source to destination in a real net-
work. The Simulator can generate performance data
like the data traffic, control traffic and the process-
ing load. The Simple Event Based System network
Simulator uses BRITE ( Boston university Represen-
tative Internet Topology gEnerator) (Alberto Medina
and Byers, 2001) to generate Internet topology. The
overlay topology is formed from the BRITE gener-
ated physical network topology by choosing the over-
lay nodes from the physical nodes. The delay and
bandwidth are calculated over the physical path that
represents the overlay link. An AS-level physical
topology of 10000 nodes, generated by BRITE using
Waxman generation model is used for overlay topol-
ogy construction. The bandwidths for the links are
uniformly distributed.
Table 2: Simulation Parameters.
Number of events 10000
Number of Event Clients 1000
Number of Event Brokers 100
Distribution of Clients Uniform
Average Message size 50 bytes
Failure distribution random
For experimentally verifying the advantages of
an underlay aware overlay topology we implemented
SIENA( Scalable Internet Event Notification Archi-
tecture)(A. Carzaniga, 2001) which has hierarchical
topology and extended it with underlay awareness
using the above discussed algorithms. The percent-
age of messages delivered to subscribers in the face
of increasing number of link and node failures is
monitored for underlay aware SIENA and unmodi-
fied SIENA. The redundant paths between parent and
child node in underlay aware Siena are used to reduce
link stress on heavily loaded links. The experimental
results of this load balancing are plotted for under-
lay aware SIENA from the simulated results. Table 2
shows the simulation parameters.
ADDING UNDERLAY AWARE FAULT TOLERANCE TO HIERARCHICAL EVENT BROKER NETWORKS
103