with a predefined confidence level, whether the
difference of the two means of the two sample
groups' performance, actually indicates that one
system has higher performance than the other, and
hence, eliminate the random factor in selecting the
samples.
Table 1: Terms used in the hypothesis testing procedure.
Term Definition
N
P
CEAS+
P
CEAS-
S
CEAS+
S
CEAS-
The number of interactions chosen as the test
period (number of test cases).
The mean performance of a sample of agents
using ITRM after their n
th
interaction.
The mean performance of a sample of agents
using no trust model after their n
th
interaction.
The standard deviation of the performance
sample of CEAS+.
The standard deviation of the performance
sample of CEAS-.
The result of carrying out the hypothesis testing
procedure for different test periods (i.e. 10, 20, 30,
40, 60, 80, and 100 interaction) is illustrated in
Table 2.
Table 2: Hypothesis testing results.
Number of
P
CEAS+
P
CEAS-
S
CEAS+
S
CEAS-
SE
DF
t
P-value
10 87.77 84.67 1.47 4.45 1.479 4.96 2.1 0.045
20 91.13 84.71 0.84 4.91 1.114 16.32 5.8 1.46E-5
30 92.92 82.74 0.65 5.83 1.071 26.07 9.5 3.02E-10
40 93.03 86.19 0.63 5.20 0.829 57.25 8.3 1.32E-11
60 93.02 86.26 0.71 4.55 0.595 297.5 11.4 2.16E-25
80 93.09 84.85 0.98 5.11 0.585 585 14.1 2.59E-39
100 92.99 85.78 0.68 5.08 0.513 732.86 14.1 3.32E-40
Since, the P-value for all cases is less than the
significance level (0.05), we cannot accept the null
hypothesis. Therefore, this table shows that the
corresponding hypothesis tests conclude that the
CEAS+ outperforms the CEAS- and that the
performance difference is statistically significant
(using the confidence level of 95%).
4 CONCLUSIONS
Previous work addressing trust, has investigated
active trust, but passive trust has not been explicitly
addressed within the field of multi-agent systems to
date. In active trust, the performance of individual
agents from various perspectives is evaluated using
various sources of trust information, such as, direct
interaction or through witness reports. But, in such
cases, the agent that is meant to be evaluated is
known in advance. In passive trust (addressed by
this work), the performance of an agent within a
group of agents that collaborate to achieve a shared
goal is what is being evaluated. To do so, a trust
ratio for each agent in the team is induced. The
presented model for achieving this task: Inducing the
Trust Ratio Model (ITRM) is thus a novel model for
trust evaluation that is specifically designed for
general application in multi-agent systems. In order
to verify the claim that this model is both effective
and useful, empirical evaluation was carried out.
Through this evaluation it was demonstrated that
agents using the trust model - ITRM - provided by
CEAS are able to select reliable partners for
interactions and, thus, obtain better utility gain
compared to those using no trust measure.
REFERENCES
Cohen, P., 1995. Empirical Methods for Artificial
Intelligence, The MIT Press.
Dong, H., Jennings, N., Shadbolt, N., 2006. An Integrated
Trust and Reputation Model for Open Multi-Agent
Systems. In Journal of Autonomous Agents and Multi-
Agent Systems.
Falcone, R., Barber, S., Korba, L., Singh, M., (editors)
2003. Trust, Reputation and Security: Theories and
Practice, Volume 2631 of Lecture Notes in Computer
Science. Springer-Verlag Berlin Heidelberg.
Falcone, R., Singh, M., Tan, Y., (editors) 2001. Trust in
Cyber-Societies: Integrating the Human and Artificial
Perspectives, Volume 2246 of Lecture Notes in
Computer Science. Springer-Verlag Berlin Heidelberg.
Ismael, O., El-Beltagy, S., Bahgat, R., Rafea, A., 2007.
Collaborative Knowledge Based System through
Multi-Agent Technology. In the 42
nd
Annual
Conference on Statistics, Computer Science, and
Operations Research. Institute of Statistical Studies
and Research, Cairo University.
Jensen, C., Poslad, S., Dimitrakos, T., (editors) 2004.
Trust Management, Volume 2995 of Lecture Notes in
Computer Science. Springer-Verlag Berlin Heidelberg.
Proceedings of the Second International Conference,
iTrust 2004 Oxford, UK, March 29 – April 1, 2004.
Rafea, A., Mahmoud, M., 2001. The Evaluation and
Impact of NEPER Wheat Expert System. Fourth
International Workshop on Artificial Intelligence in
Agriculture IFAC/CIGR, Budapest, Hungary.
Ramchurn, D., Dong, T., Jennings, R., 2004. Trust in
Multi-Agent Systems. The knowledge engineering
review, 19(1): 1-25, March 2004.
ICAART 2009 - International Conference on Agents and Artificial Intelligence
474