analysing the effects of changing F and NP on market
behaviour, we can gain insights into how these algo-
rithms influence market dynamics and how they can
be optimised to maximise profitability.
2 BACKGROUND
Gode and Sunder (Gode and Sunder, 1993) revo-
lutionised the field of experimental economics in
1993 with the advent of the Zero-Intelligence Con-
strained (ZIC) trader-agent. These agents, which gen-
erate quote prices uniformly at random from a pre-
defined range, were shown to reproduce surprisingly
human-like market dynamics. In their original pa-
per, Gode and Sunder defined the feasible range of
trading prices as [1, 200]. As such, a buyer with a
limit price of λ
B
would generate quote prices from
U(1, λ
B
), whilst a seller with a limit price of λ
S
would
generate quote prices from U(λ
S
, 200). Modern im-
plementations of the ZIC model do not rely on a priori
information about the feasible range, instead utilising
the lowest and highest values in the order book as the
bounds. Cliff and Bruten critiqued much of Gode and
Sunder’s work on ZIC in (Cliff and Bruten, 1997), but
the work that arose from the ZIC model is still widely
used in studying market dynamics. Many subsequent
zero-intelligence (ZI) and minimal-intelligence (MI)
trader-agents have stemmed from the work of Gode
and Sunder.
One such ZI trader-agent, Parameterised-
Response Zero-Intelligence (PRZI) (Cliff, 2021), was
recently introduced by Cliff. PRZI is a nonadaptive
generalisation of ZIC, in which the shape of the
probability mass function (PMF) used to generate
quote prices is governed by a strategy parameter
s ∈ [−1, 1] ∈ R. This s-value determines the degree
of ‘urgency’ or ‘relaxation’ in the trader’s behaviour.
As s → 1, the distribution is evermore biased towards
‘urgent’ quote prices—those closest to the least
profitable price for the trader but most likely to attract
a willing counterparty—conversely, as s → −1,
the distribution is biased towards ‘relaxed’ quote
prices—those that generate the most profit for the
trader but are considerably less likely to attract a
counterparty. When s = 0, the PMF is uniform,
identical to that of a ZIC trader.
PRZI with Stochastic Hill Climbing (PRSH)
(Cliff, 2022a) is an adaptive extension to PRZI, also
introduced by Cliff. The algorithm dynamically alters
the strategy parameter s in an attempt to increase prof-
itability. A given PRSH trader i maintains a private
local population S
i
of k strategy parameters, each of
which it evaluates via a loop to identify the most prof-
itable. The most profitable strategy is ‘mutated’ via a
stochastic mutation function to produce k −1 mutants,
and this set of k strategies constitutes the new S
i
.
PRZI with Differential Evolution (PRDE) (Cliff,
2022b) is Cliff’s latest adaptive extension to PRZI. It
replaces the simple stochastic hill climber in PRSH
with a DE optimisation system (Storn and Price,
1997). A given PRDE trader i maintains its own DE
system with a population of candidate s-values S
i
of
size NP ≥ 4 denoted by s
i,1
, s
i,2
, ..., s
i,NP
. Once trader i
has evaluated a particular strategy s
i,x
, three other dis-
tinct s-value are chosen at random: s
i,r
1
, s
i,r
2
and s
i,r
3
such that x 6= r
1
6= r
2
6= r
3
. A new candidate strategy
s
i,y
is constructed as follows:
s
i,y
← max(min(s
i,r
1
+ F
i
(s
i,r
2
− s
i,r
3
), 1), −1) (1)
where F
i
is trader i’s differential weight coefficient.
The fitness of s
i,y
is evaluated, and if it performs bet-
ter than s
i,x
, then s
i,y
replaces s
i,x
; otherwise, it is
discarded and the next randomly selected strategy is
evaluated. PRDE also includes a ‘mega-mutation’
mechanism to deal with convergence issues that arise
from s
i,r
2
−s
i,r
3
tending very close to zero in a highly-
converged population. Suppose at any time the stan-
dard deviation of the candidate s-values in trader i’s
private population is less than 0.0001. In that case, a
randomly selected candidate s
i,r
is given a value sam-
pled randomly from U(−1, 1).
3 HOMOGENEOUS
EXPLORATION OF F AND NP
3.1 Overview of the Combined Effect of
F and NP
According to Cliff’s research in (Cliff, 2022b), using
DE as the adaption mechanism for each trading entity
in a market can double the profit extracted through
traders’ interactions, compared to using a stochastic
hill climber as the adaption mechanism. However, in
his simulations, Cliff only experimented with F = 0.8
and NP = 4 for the PRDE traders. As such, the ini-
tial experiments in this paper serve as a follow-up: to
analyse the combined effect that different values of F
and NP have on profitability. I designed a set of ex-
periments with a similar setup as Cliff, in which BSE
was used to simulate a financial market for a single
abstract tradeable commodity. In each market simu-
lation, I implemented a homogeneous population of
N
T
= 30 PRDE traders with an equal number of buy-
ers N
B
and sellers N
S
(i.e. N
B
= N
S
= 15). The pop-
ulation in a given experiment was homogeneous with
ICAART 2023 - 15th International Conference on Agents and Artificial Intelligence
136