but because many managers announce tasks simulta-
neously in a busy MMAS, the managers may have to
wait a long time to receive a sufficient number of bids:
This significantly reduce the performance of the entire
system. In the original conception of CNP (Smith,
1980), the use of multiple bids was proposed to con-
currently handle many announcements. If a contrac-
tor is awarded multiple bids simultaneously, however,
it may not be able to provide the quality or perfor-
mance guaranteed in the bids. In fact, more highly
capable contractor agents tend to be selected by many
managers. Additionally, the task structure, meaning
a task consisting of a number of different subtasks,
makes this situation more complex.
In this paper, we propose the award strategy,
called the adaptive probabilistic award strategy, to
improve the overall performance of MMAS. The first
key idea of the proposed strategy is the probabilis-
tic selection of awardee according to the task loads
of the system. However, the task loads of the system
are hardly given to each agent. Thus the second idea
is that manager agents estimate the task loads using
statistical data (more precisely, the difference in the
standard deviation – SD) of bid values for different
subtasks, where we assume that a task consists of a
number of subtasks that have different costs.
This paper is organized as follows. First, we will
discuss the model of CNP, which has been slightly
modified for MMASs to avoid long waits and to re-
duce the number of messages, the simulation environ-
ment and the issues addressed in this paper. Then, we
clarify how some degree of fluctuation can improve
the overall performance even if tasks have structures
and, by taking advantage of this effect, we propose
the adaptive probabilistic award strategy. Finally, we
experimentally show how our proposed method can
significantly improve overall performance.
2 MODEL AND ISSUES
2.1 Model of CNP for Massively MASs
Let A = {1,...,n} be a set of agents, T be a task,
and F = { f
1
,.. ., f
d
} be the set of skills, or func-
tions that agents can perform. We assume that task
T consists of subtasks, t
1
,.. .,t
l
, (therefore, we de-
note T = {t
1
,.. .,t
l
}) and that subtask t(∈ T ) requires
s(t)-th skill, f
s(t)
, to perform it, where 1 ≤ s(t) ≤ d.
A subtask is denoted by lower-case letter t and is
simply called a task unless this creates confusion.
Agent i is expressed as a tuple, (α
i
,L
i
,S
i
,Q
i
), where
α
i
= (a
1
i
,.. ., a
d
i
) is the set of the agent’s capabilities
(a
l
i
corresponds to the l-th skill, f
l
, and a
l
i
≥ 0; a
l
i
= 0
indicates agent i does not have skill f
l
), L
i
is the loca-
tion of i, and Q
i
is the queue where the agent’s tasks
are stored, waiting to be executed one by one. Set
S
i
(⊂ A) is i’s scope, i.e., the set of agents that i knows.
The metric between agents, δ(i, j), is based on their
locations, L
i
and L
j
, and is used to define the commu-
nication time (or delay) of messages between i and
j.
Subtask t has an associated cost, γ(t), which is the
cost to complete it. Subtask t can be done by i in
dγ(t)/a
s(t)
i
e unit times, where dxe denotes the ceiling
function. This time is also called the execution time
of t by i. Task T is completed when all its subtasks
are completed. The cost of T is defined as γ(T ) =
∑
t∈T
γ(t).
In every unit time, tl(≥ 0) tasks on average are
generated according to a Poisson distribution and ran-
domly assigned to different managers. Parameter tl is
called the task load and denotes tl tasks per unit time,
or simply tl T/t.
For CNP, we defined M = {m
j
}(⊂ A) as the set
of managers, who allocate tasks, and C = {c
k
}(⊂ A)
as the set of contractors, who execute the allocated
tasks. Let us assume that |A| is large (on the order of
thousands), therefore |M | and |C | are also large, and
that the agents are distributed widely, like servers on
the Internet.
2.2 Task Allocations for MMAS
In our experiments, we used the CNP modified for use
in an MMAS to reduce the number of messages and
to prevent long waits for a response. In this CNP, (1)
multiple bids and regret and no-bid messages are al-
lowed, and (2) manager m announces subtasks in T to
restricted contractors that are selected from its scope,
S
m
, on the basis of an announcement strategy. Regret
messages are sent in the award phase to contractors
who have not been awarded the contract; no-bid mes-
sages are sent to managers by contractors who have
decided not to bid on an announced task. These mes-
sages prevent long waits for bids and award messages
(e.g., (Sandholm, 1993; Xu and Weigand, 2001)).
When manager m receives task T , it immediately
initiates the modified CNP to allocate each task
˜
t(∈
T ) to an appropriate contractor agent. It first sends
announcement messages to the contractors selected
from its scope in accordance with the announcement
strategy. Each contractor receiving the announcement
sends back a bid message with a certain value called
the bid value. The bid values in general might include
parameters such as the price for executing the task, the
quality of the result, or a combination of these values.
Because we assume that agents are rational in terms
ICAART 2010 - 2nd International Conference on Agents and Artificial Intelligence
166