The rest of the paper is organized as follows: in
section 2 the DE state of the art is described, with the
main focus on the SHA mechanism, section 3 presents
the modified Lehmer mean and theoretical insights,
section 4 contains the experimental setup and results,
as well as discussion, and section 5 concludes the pa-
per.
2 SUCCESS HISTORY BASED
DIFFERENTIAL EVOLUTION
The Differential Evolution was originally proposed
by K. Price and R. Storn in (Storn and Price, 1997)
for real-valued optimization problems solving. DE
is a population-based algorithm, in which the popu-
lation of NP individuals is represented as x
i, j
, where
i = 1, ..., NP, j = 1, ..., D, where D is the problem di-
mension. The goal is to minimize the function f(X)
with respect to bound constraints [xmin
j
, xmax
j
].
The DE starts by initializing the population ran-
domly within the boundaries, and proceeds by per-
forming mutation, crossover and selection (replace-
ment) operations. The mutation operator is the key
component of DE, which generates new mutant vec-
tor by combining the vectors from the population.
There are several mutation strategies known, how-
ever, in this study the current − to − pbest strategy,
introduced in JADE algorithm (Zhang and Sanderson,
2009), is applied:
v
j
= x
i, j
+ F ∗ (x
pb, j
− x
i, j
) + F ∗ (x
r1, j
− x
r2, j
) (1)
where F is the scaling factor, which is a parameter
usually in range [0, 1]. The pb index is chosen from
p% best individuals in the population, while r1 and r2
are chosen randomly from the population. Next, the
crossover operation is performed, in which the trial
vector is defined as:
u
j
=
(
v
j
, if rand(0, 1) < Cr or j = jrand
x
i, j
, otherwise
where Cr is the crossover rate in range [0,1],
jrand is set to random index in [1, D] and used to
make sure that at least one variable is taken from the
mutant vector. After this, the selection procedure is
applied, and the newly generated trial vector u re-
places the target vector x
i
if it has at least as good
fitness:
x
i, j
=
(
u
j
, if f(u
j
) ≤ f(x
i, j
)
x
i, j
, otherwise
The scaling factor F, crossover rate Cr, as well as
population size NP are three main control parameters
if DE. The SHADE algorithm (Tanabe and Fukunaga,
2013) improved the adaptation procedure in JADE
(Zhang and Sanderson, 2009) by introducing several
memory cells containing best known parameter val-
ues combinations, which were then used to generate
newtrial vectors. Initially there were H memory cells,
each containing a pair M
F
, M
C
r, which are set to 0.5,
and the current memory index h was set to 1. For each
mutation and crossover the F and Cr values were gen-
erated using Cachy distribution and normal distribu-
tion with scale parameter and standard deviation of
0.1 respectively:
(
F = randc(M
F,h
, 0.1),
Cr = randn(M
Cr,h
, 0.1)
If the newly generated F or Cr is outside the [0, 1]
interval, then it is generated again until it satisfies this
condition. During the selection step, if the trial vec-
tor was successful, i.e. better then target vector, the
values of F and Cr were stored in S
F
and S
C
r, as well
as the improvement value ∆f
j
= | f (u) − f(x
i
)|. After
the end of the generation, current h-th memory cell is
updated using the weighted Lehmer mean:
mean
wL
(S) =
∑
|S|
j=1
w
j
S
2
j
∑
|S|
j=1
w
j
S
j
(2)
where S is either S
F
or S
C
r, and the weight value
w
j
=
∆f
j
∑
|S|
k=1
∆f
k
. The index of memory cell h is incre-
mented every generation and set to 1 if h = H. The
idea of using several memory cells is to provide more
robust parameter adaptation, in which the fluctuations
of F and Cr would not influence the searh signifi-
cantly. The SHA mechanism is sensitive not only to
the improvement fact, but also to the value of the im-
provement, i.e. ∆f.
3 GENERALIZED LEHMER
MEAN FOR SUCCESS
HISTORY ADAPTATION
Originally, the usage of Lehmer mean was proposed
in JADE (Zhang and Sanderson, 2009) algorithm for
the calculation of F values only. In rJADE (Peng
et al., 2009), the weighted procedure for F calcula-
tion was proposed, where the ∆f
j
values were used.
Finally, in LSHADE (Tanabe and Fukunaga, 2014)
the Lehmer mean was used for both F and Cr calcu-
lation.