detectors can be added into the system whenever more
information is required to make final decision.
The local decisions u
n
, n = 1,2,3 can be expressed
as
u
n
=
(
0, thenth detector favoursH
0
1, thenth detector favoursH
1
(4)
where ”favours” should be interpreted as the distance
between actual sample covariance estimate and refer-
ence covariance estimate is smaller than the empiri-
cal threshold for a particular false alarm rate. We use
P(H
0
) to denote prior probability that the seizures are
not present in a particular signal segment. A com-
mon assumption used here is the local observations
y
n
are conditionally independent, given the unknown
hypothesis H
i
.
After receiving the local decisions, the fusion cen-
tre makes the global decision by applying an optimal
fusion rule in order to minimize the final error prob-
ability. For a binary hypothesis testing problem, the
error probability P
e
is given by
P
e
= P(H
0
)P(u
0
= 1|H
0
) + P(H
1
)P(u
0
= 0|H
1
) (5)
The authors provided the optimality criterion for N
local detectors in the sense of minimum error prob-
ability in (Varshney, 1986). We recall it here for the
case of N = 3.
u
0
=
(
1, if w
0
+
∑
3
n=1
w
n
> 0
0, otherwise
(6)
where, w
0
= log
P
1
P
0
(7)
and w
n
=
(
log((1−P
m
n
)/P
f
n
), if u
n
= 1
log(P
m
n
/(1−P
f
n
)), if u
n
= 0
(8)
The probabilities of false alarm and missed detec-
tion of the nth local detector are denoted as P
f
n
and
P
m
n
, respectively. The optimal fusion rule tells us that
the global decision u
0
is determined by the a priori
probability and the detector performances, i.e., P
1
, P
f
n
and P
m
n
. However, they are all unknown in our seizure
detection problem, which is usually the case in many
other real applications (Mirjalily, 2003),(Liu et al.,
2007). In order to make the final decision, we need
to utilize the information available to us: the local bi-
nary decisions u
n
.
Suppose the decision combination {u
1
= i, u
2
=
j and u
3
= k} is represented by ℓ = (ijk)
2
, where
i, j,k = 0 or 1 (Mirjalily, 2003). In our system, the
number of all the possible local decision combina-
tions is 2
3
and will be denoted as L in the remainder of
this paper. The joint probability of decision {u
1
= i,
u
2
= j and u
3
= k} is also the occurrence probability
of the ℓth decision combination, given by
P
ℓ
= Pr(u
1
= i, u
2
= j, u
3
= k)
= P(u
1
= i|H
1
)P(u
2
= j|H
1
)P(u
3
= k|H
1
)P
1
+P(u
1
= i|H
0
)P(u
2
= j|H
0
)P(u
3
= k|H
0
)(1−P
1
)
(9)
P(u
n
= i|H
1
) =
(
1−P
m
n
, if i = 1
P
m
n
, if i = 0
(10)
P(u
n
= i|H
0
) =
(
P
f
n
, if i = 1
1−P
f
n
, if i = 0
(11)
In this nonlinear system, only seven out of eight
equations are independentsince
∑
P
ℓ
= 1 and there are
seven unknowns P
1
, P
f
n
and P
m
n
, for n = 1,2,3. Thus,
it can be solved when P
ℓ
are known. Although P
ℓ
is
usually unavailable in practice, it could be replaced
by empirical probability defined as
P
ℓ
= Pr(u
1
= i,u
2
= j,u
3
= k)
≃
number ofu
1
= i,u
2
= j,u
3
= k
number of local decisionsN
t
(12)
where N
t
is the number of decisions made by one
of the local detectors. The analytical solution to
the above nonlinear equations is given in (Mirjalily,
2003).
Note that in a particular setting if the data size is
limited and/or the number of events needed for accu-
rate calculation of anomalies is not sufficient we de-
veloped a maximum likelihood based algorithm that
exploits the multinomial probability mass function
describing the decision vector and utilized in order to
estimate the anomalies as well as prior probabilities
(seizure and no-seizure). We presented the details of
these algorithms in (Liu et al., 2014).
3 RESULTS
We evaluate the performance of the proposed algo-
rithms on the data set consisting of preterm infants
(GA less than 32 weeks) admitted to the Neonatal In-
tensive Care Unit at McMaster Hospital. Due to phys-
ical limitations we were able to obtain prior expect
knowledge on a very limited time length and hence
all of the non-seizure epoch were shorter than 400
samples using single C3 channel with minimal mo-
tion artefacts.
For illustrational purposes in Figures 2-4 , we il-
lustrate the detection performance as a scatter dia-
gram of windows selected from testing data. Note that
in the presence of motion artifacts the actual perfor-
mance will actually vary significantly. Furthermore