we model different social groups as formed agent-
coalitions of this multiagent system. Even though so-
cial groups evolve, as is their nature, with hierarchical
roles which increase in number over time, for simplic-
ity, our model simulates only two roles, the leader and
the members of the coalition.
As a leader, an agent considers an interesting
trade-off between the size and the joining-fee of its
coalition. On the one hand, the size of a coalition
influences the strength of the leader proportionally,
meaning that if the size of the coalition is expand-
ing, the power of the leader is gradually increasing.
On the other hand, members of the coalition utilize
their resources, such as money, effort, or ideas on be-
half of the coalition. To represent these dual influence
factors of the coalition, we introduce a joining-fee, as
shown in Equation 3. With a joining-fee, the agent
who is willing to join a coalition is forced make a
one-time payment defined by the leader, but a higher
joining-fee motivates agents to consider some other
coalition with a lower joining-fee, ultimately weaken-
ing the leader of the coalition on a higher joining-fee.
joining-fee
l
i
= X −ln(
|
coalition(l
i
)
|
) (3)
In Equation 3, the joining-fee of a coalition that
is headed by leader agent l
i
is defined as the differ-
ence between a predefined value X and the natural
logarithm of the coalition’s size. In our model, the
joining-fee decreases with the increase in the number
of members in stable coalitions, and when the society
reaches equilibrium, the difference in the fees to join
these coalitions is negligible.
To represent the dual influence stated above, BR
imposes a fee on the members of a coalition for the
leader as a portion of their utilities in each iteration. In
contrast, the individual agent, who is willing to join a
coalition, does not consider the fee to join a coalition.
Instead the agent rewards the best neighbor agent in
the coalition. This is determined by analyzing the
earnings of the neighbor agent, in terms of the util-
ities paid so far in the iteration. The more the agent
earns, the better a neighbor it is.
In our simulation, irrespective of the agent’s role,
each agent selects its best neighbor according to the
following preference: 1) the best neighboring leader
(
ˆ
l
i
) in terms of its coalition’s joining-fee and earned
utilities; 2) the best neighbor ( ˆn
i
) that is selected
by utilizing the learning model and 2 heuristics: a
higher average and a higher rise of utilities in past few
rounds
1
. In the case of selecting the best neighboring
agent (
ˆ
l
i
or ˆn
i
), the learning model is utilized by agent
1
In our model, the number of considered past interac-
tions is a predefined limit for all agents.
i if it does not deploy the 2 heuristics described above
due to inadequate neighborhood interactions. After
selecting the best neighboring agent, each agent’s be-
havior is categorized according to its role.
1. Independent Agent: Leader
ˆ
l
i
invites agent i to
join with the coalition. In our model, agent i ac-
cepts the invitation to join the coalition by paying
the coalition’s joining-fee. However, if the best
neighbor ˆn
i
of agent i does not belong to the coali-
tion led by leader
ˆ
l
i
, then the agent will remain in-
dependent, and not join the coalition at this junc-
ture. If in the future it’s best neighbor and the
majority neighbors
2
of agent i join the coalition
led by leader
ˆ
l
i
, then agent i may also join at this
later time. In an interaction between each neigh-
bor, agent i decides to cooperate or defect based
on the outcome of its learning strategy. The de-
cision of agent i is independent of its neighbors’
current activities, but it depends on previous ex-
periences and learning practices with neighbors.
2. Member of a Coalition: Agent i may decide to ac-
cept or reject an invitation to join a different coali-
tion of leader
ˆ
l
i
based on the coalition’s size as a
heuristic. When the heuristic is false (i.e., size of
agent i’s coalition is large), agent i not only re-
fuses the invitation but also defects with leader
ˆ
l
i
.
If the heuristic is true agent i accepts the invita-
tion and joins a new coalition. Agent i joins a
coalition proposed by best neighbor ˆn
i
, when the
heuristic is true
3
and the value from the learning
model is optimistic. The optimistic value from the
learning model has a higher probability of coop-
eration with neighbors, and so earns higher utili-
ties in the future. If best neighbor ˆn
i
is an inde-
pendent agent and has positive interactions in the
past (according to the learning model) then agent
i invites ˆn
i
to join with its coalition. Positive in-
teractions express the level of cooperation in the
past. As mentioned above, agent i always coop-
erates with other members of the same coalition,
and follows the decision from the learning model
for other neighbors.
3. Leader of a Coalition: After determining the
under-performance of its coalition members (i.e.,
their utilities are negative), leader l dissolve the
coalition and all its members become independent
agents. In the case of interacting with best neigh-
boring leader
ˆ
l
l
, leader l merges with the coali-
tion proposed by
ˆ
l
l
based on 2 heuristics: 1) if
the size of the proposed coalition is sufficiently
2
The threshold value of selecting the majority prefer-
ence is predefined in the simulation.
3
|
coalition (i)
|
<
|
coalition (ˆn
i
)
|
ICAART2015-InternationalConferenceonAgentsandArtificialIntelligence
26