If we denote the Wilson's central terminal potential
as φ
WCT
and subtract it from both unipolar potentials
in equation (2), we get:
,
(3)
and finally:
.
(4)
Equation (4) is used to calculate a bipolar lead from
two MECG unipolar leads.
For 31-electrode MECG from Figure 1, 465
bipolar leads can be obtained. However, bipolar
lead's wireless hardware implementation tends to
become smaller and smaller. Hence, such devices
could benefit from a small inter-electrode distance,
which restricts the set of all useful bipolar leads
from MECG, to the set of bipolar leads formed only
from nearest neighbouring electrodes. In the case of
31-electrode MECG the useful set contains just 81
bipolar leads with 85320 possible combinations of
three bipolar leads.
Since the reduction in inter-electrode distance
inevitably reduces signal strength, three bipolar
leads for synthesis of 12-lead ECG may be selected
by means of evaluating signal strength from various
bipolar leads (Puurtinen et al., 2009).
2.3 Multivariate Linear Regression
To model the relationship between a 12-lead ECG
and a set of three approximation leads we used
MLR.
First, a MECG dataset is divided into two
approximately equal intervals. Chronologically first
interval is used by MLR algorithm to calculate
transformation coefficients, and the second interval,
not known to the MLR algorithm, is used for the
estimation of algorithm's efficiency.
Let a set of three arbitrary bipolar leads from the
first interval of the MECG be denoted by:
1,2,3.
(5)
The 12-lead ECG from the first interval of the
MECG is represented as a set of 12-leads:
12,,,,,,
1,2,3,4,5,6.
(6)
As it was already mentioned, every MECG
measurement contains enough leads to exactly
reproduce the standard 12-lead ECG, so ECG12 is
produced from X (see equation (1)) and will
represent a target ECG for our approximation.
Generally, linear regression model represents the
relationship between a response (i.e. criterion
variable) ECG12 and a predictor B (Tabachnik &
Fidell, 2001, chap. 5):
12
.
(7)
The response is modelled as a linear combination of
functions (not necessarily linear) of the predictor,
plus a random error ε. The expressions f
j
(B),
(j=1,…,p) are the terms of the model while the α
j
,
(j=1,…,p) are the coefficients. Errors ε are assumed
to be uncorrelated and distributed with mean 0 and
constant, but unknown, variance. Our problem can
be solved by the multivariate regression due to the
fact that the response variable ECG12 is
multidimensional, i.e. it is composed of 12 leads
(variables).
Given n independent observations (samples):
(B
1
, ECG12
1
),…,(B
n
, ECG12
n
) of the predictor B
and the response ECG12, the linear regression
model becomes an n-by-p system of equations:
12
12
…
…
·
, or
(8)
12·,
(9)
where M is the design matrix of the system. The
columns of M are the terms of the model evaluated
at the predictors. To fit the model to the input data,
the system must be solved for the p coefficient
values: [α
1
… α
p
], by applying the least-squares
solution, i.e. by minimizing the norm of the residual
vector: ECG12-M·α. We used MATLAB "regress"
function (The MathWorks, 2009) to solve the system
from equation (9).
The predictor B is multidimensional because it is
composed of three variables, so are the functions f
j
that form the terms of the model. For three
dimensional predictor B={B
1
, B
2
, B
3
}, terms for the
model might include f
1
(B) =B
1
(or for example
f
1
(B)=B
2
), which are linear terms, f
2
(B)=B
1
2
(quadratic terms), and f
3
(B)=B
1
·B
2
(a pairwise
interaction term). Typically, the function f(B)=1 is
included among f
j
, so that the design matrix M
contains a column of ones and the model contains a
constant term.
We have explored the usage of linear additive
(straight-line) models with terms f(B) = 1 and f(B) =
HEALTHINF 2010 - International Conference on Health Informatics
218