tems which perform actions to interact with an envi-
ronment. Extensive literature reviews of conventional
works to deal with the means to remain dependabil-
ity of autonomous agents are presented by Guiochet
(Guiochet, 2015). It is noted that not all the faults
from the system always can be analyzed and removed.
Fault tolerance therefore aims at continuing agent ser-
vices even with the presence of faults during the op-
erational stages of autonomous agents. Usually, fault
tolerance is implemented by using redundant agents,
i.e., once a failure is present, the backup agent is acti-
vated to replace the failed one. One limitation of most
conventional works is that there is a lack of investi-
gation on the analysis of fault tolerance mechanism
within the scope of system dependability. Obviously,
the evaluation on the fault tolerance ability of a sys-
tem could provide valuable information to improve
the system performance. This paper therefore pro-
poses a fault tolerance analysis for autonomousagents
within the context of agent-agent collaboration. By
the means of fault analysis, different methods such as
Petri net (PN), fault tree analysis (FTA), failure modes
effects and criticality analysis (FMECA), and hazard
operability (HAZOP) have been developed (Bernardi
et al., 2013). Yet, the PN framework has received a
lot of attention from research community due to its
wide applications for fault prevention in both devel-
opment and operational stages of an agent architec-
ture. Which also includes support for mitigation of
the implementation progress. With regards to those
advantages, an extended PN with colored time PN
(CTPN) for the analysis of fault tolerance mechanism
of collaborative agents has been chosen.
The rest of the paper is organized as follows. Sec-
tion 2 presents extensive literature reviews related to
this work. The analyses of fault tolerance in both cen-
tralized and decentralized approaches together with
PN background are described in Section 3. Exper-
imental results are illustrated in Section 4. Finally,
Section 5 concludes the paper with discussion of fu-
ture works.
2 RELATED WORKS
As aforementioned,the assessment of system depend-
ability is based on the basic attributes. Depending on
the specific applications, different attributes are used
to measure the dependability of a system. In the early
developments of software platforms, a multi-level
view of dependable computing was first developed by
Parhami (Parhami, 1994), in which most dependabil-
ity attributes were implemented. For robotics, with
regards to the safety to assess the dependability, an
intelligent home care robot to assist elderly people
was introduced by (Graf and Gele, 2001). The pro-
posed home care system was equipped with alterna-
tive levels of safety to prevent accidents caused by a
person being hit by the robot. The safety navigation
system consisted of user interface, path planning, and
obstacle avoidance with extensive sensors for motion
detection. In the work of (Mustapic et al., 2004), a
safe platform for industrial robotics has been devel-
oped. The authors have initialized the architectural
level of how to open a platform for quality constraints
and how to implement fault prevention. Although the
aspect of dependability was of major interest, these
papers did not address issues related to collaborative
robots.
For reliability analysis, PN has been applied as
an effective technique to model dependability (Mal-
hotra and Trivedi, 1995). Recently, reliability assess-
ment was introducedwith time PN and Markov chains
(Kohl´ık, 2009). The analysis of fault tolerance in
manufacturing systems by using PN was developed
by (Miyagi and Riascos, 2004). In their study, the
hierarchical and modular integration of PN was com-
bined to analyze production process, fault detection
process, and fault treatment process. Meanwhile, the
application of the generalized stochastic PN used on
the navigation of a single service agent was presented
by (Kim and Chung, 2007). The coordination of mul-
tiple controllers for agent navigation was then intro-
duced by (Moon and Chung, 2012). Similarly, PN
was used for the control of a group of robotic agents
(Joaquin et al., 2011).
In another aspect, the development of fault toler-
ance aims at increasing the reliability of a system. In
the work of (Troubitsyna and Javed, 2014), adaptive
fault tolerance was developed with regards to the sys-
tem dependability. A research on fault tolerance for
a group of agents in a cooperative environment was
described by (Haddad and Haddad, 2004). In their re-
search, the authors proposed a communication mech-
anism between the agents in a team to coordinate and
allocate the resources. The PN was used to illustrate
the model of the whole system. However, the research
is limited to a scheduling protocol for an agent team.
Close to our study, the fault tolerance analysis with
PN for a coordination of multi agents was developed
by (Acharya et al., 2014). However the approach pro-
posed by Acharya et al. has just initialized a picture
of how the system may look like. The study lacked
experimental setup for validation. Moreover, the non-
colored and non-hierarchy PN structure used in the
approach made the design complicated and unclear.
In this paper, the colored time PN enhanced with a
hierarchy structure is utilized to analyze the depend-