EFFECT OF CORRELATION BETWEEN CLINICAL TESTS ON

THE PERFORMANCE OF A MULTIPLE TEST-BASED

DIAGNOSTIC SYSTEM

Study with a Logistic Model and Neural Nets

Noriaki Ikeda

, Kai Ishida

, Harukazu Tsuruta

and Akihiro Takeuchi

Medical Informatics, School of AHS, Kitasato University, Sagamihara, Kanagawa Japan

Graduate School of Medical Sciences, Kitasato University, Sagamihara, Kanagawa Japan

Keywords:

Multiple tests, Diagnostic performance, Correlation between tests, Logistic model, Neural nets.

Abstract:

To examine the improvement of diagnostic performance by combining multiple tests, an algorithm was de-

veloped for generation of simulated data with arbitrary sensitivity, speciﬁcity and inter-test correlations. The

effects of the number of tests and inter-test correlations on the diagnostic performance were studied using a

logistic model and neural network (NN) models. The diagnostic performance measured by the concordance

index, c, increased as the number of tests increased. For the same number of tests, the diagnostic performance

was lowered by positive correlation and was elevated by negative correlation. Improvement of the performance

was not obtained by increasing the number of NN layers.

1 INTRODUCTION

It is a common practice in clinical medicine to de-

velop a better (more reliable) diagnostic system using

multiple tests that individually are less reliable (Ikeda

et al., 2006; Ikeda et al., 2007). For example, Hara et

al. reported that a better diagnostic index for predic-

tion of improvement of left ventricular ejection frac-

tion (LVEF) after cardiac resynchronization therapy

(CRT) in patients with heart failure could be obtained

using a combination of three indices of cardiac func-

tion, such as Radial, OWD and IVMD (Hara, 2008).

A logistic model is often used for combining mul-

tiple tests, each of which has a sensitivity and speci-

ﬁcity. The factor with a greater sensitivity and speci-

ﬁcity has a larger regression coefﬁcient. A neural net-

work (NN) model may be effectively used for a case

with strong nonlinearity.

If the tests are mutually independent the diagnos-

tic performance is expected to increase as the number

of combined tests becomes large. The ﬁrst problem is

to determine the relationship between the diagnostic

performance and the number of tests. However, there

are often correlations among tests. Improvement in

diagnosis is clearly not possible if these correlations

are strongly positive, whereas the effect of a negative

correlation is less clear. Therefore, the second prob-

lem is to determine the effect of inter-test correlations

on the diagnostic performance.

The purpose of the present study was to develop

an algorithm that calculates the probability of the

outcome of combined tests when the sensitivity and

speciﬁcity of each test and the inter-test correlations

are given, and to study the two problems described

above based on simulated data generated by the algo-

rithm.

In this study, we only deal with binary tests with

outcomes that are positive (1) or negative (0).

2 METHODS

2.1 Joint Probability of Two Tests

The relationship between disease D and a clinical test

can be presented as a contingency table (Table 1),

in which D reﬂects the status of the patient (D = 1

indicates having the disease and D = 0 indicates not

having the disease) and T

indicates the result of the

i-th test (positive T

=1, negative T

= 0).

The sensitivity and speciﬁcity of the test are rep-

resented by α

and β

, respectively. For D = 1, the

correlation coefﬁcient between test T

and test T

326

Ikeda N., Ishida K., Tsuruta H. and Takeuchi A..

EFFECT OF CORRELATION BETWEEN CLINICAL TESTS ON THE PERFORMANCE OF A MULTIPLE TEST-BASED DIAGNOSTIC SYSTEM - Study

with a Logistic Model and Neural Nets.

DOI: 10.5220/0003655703260329

In Proceedings of the International Conference on Neural Computation Theory and Applications (NCTA-2011), pages 326-329

ISBN: 978-989-8425-84-3

 2011 SCITEPRESS (Science and Technology Publications, Lda.)

Table 1: Contingency table of test Ti and diagnosis D.

D = 1 D = 0

= 1 α

1− β

= 0 1− α

; and for D = 0, the correlation coefﬁcient is r

−

For a case with D = 1, the joint probability

,(k,m = 0, 1) of T

and T

are given by:

= Pr(T

= 1,T

= 1) = α

+ φ

(1)

= Pr(T

= 1,T

= 0) = α

(1− α

) − φ

(2)

= Pr(T

= 0,T

= 1) = (1− α

)α

− φ

(3)

= Pr(T

= 0,T

= 0) = (1− α

)(1− α

) + φ

, (4)

where

= r

(1− α

)(1− α

). (5)

For a case with D = 0, similar relationships are

obtained by replacing α

by 1− β

Cα

by 1− β

by r

−

Cand φ

by φ

−

= Pr(T

= 1, T

= 1) = (1− β

)(1− β

) + φ

−

(6)

−

= Pr(T

= 1,T

= 0) = (1− β

)β

− φ

−

(7)

−

= Pr(T

= 0,T

= 1) = β

(1− β

) − φ

−

(8)

−

= Pr(T

= 0, T

= 0) = β

+ φ

−

(9)

where

−

= r

−

(1− β

)(1− β

) (10)

2.2 Data Generation Algorithm

A general theory of the distribution of n binary items

has been established (Bahadur, 1961).

Let X denote the set of all points x = (x

,...,x

)

with each x

= 0 or 1. Let p(x) be a given probability

distribution on X, i.e.,

p(x) ≥ 0,

∑

x∈X

p(x) = 1. (11)

For each i = 1,...,n, let

= E

), 0 < α

< 1, i = 1,...,n (12)

where E

denotes the expected value about p. If the

variables x

,...,x

are mutually independent, we

have

p(x) =

∏

i=1

(1− α

)

1−x

(13)

When there are correlation among the variables, Ba-

hadur gave the following theorem (Bahadur, 1961).

mTheoremn@For all x = (x

,...,x

) on X

p(x) =

∏

i=1

(1− α

)

1−x

f(x) (14)

with

f(x) = 1 +

∑

i< j

∑

i< j<k

ijk

+... + r

12···n

· · · y

(15)

= (x

− α

(1− α

) (16)

= E

) (17)

ijk

= E

) (18)

... (19)

12···n

= E

· · · y

), (20)

where r

is the second-order correlation, r

ijk

is the

third-order correlation, etc.

Similarly, if we set

1− β

= Pr(x

= 1|D = 0) (21)

= (x

− 1+ β

(1− β

), (22)

then the probability distribution q(x) for D = 0 is

given by

q(x) =

∏

i=1

1−x

(1− β

)

g(x) (23)

with

g(x) = 1+

∑

i< j

∑

i< j<k

ijk

+... + s

12···n

· · · z

(24)

= E

) (25)

ijk

= E

) (26)

... (27)

12···n

= E

· · · z

). (28)

With this theory, all probabilities of combination

of outcomes of tests with arbitrary sensitivity, speci-

ﬁcity and correlations among the tests can be com-

puted.

2.3 Example of Test Data

A data set of N tests with the following conditions was

generated by the method described in section 2.2. The

sensitivity and speciﬁcity of each test were both set to

0.6:

= β

= 0.6, i = 1,2,...,N. (29)

We deﬁne R

as the correlation matrix among

tests for the population with disease (D

), and R

−

that for the population with no disease (D

−

). Higher

order correlations (> 2) were set to zero in this study,

EFFECT OF CORRELATION BETWEEN CLINICAL TESTS ON THE PERFORMANCE OF A MULTIPLE

TEST-BASED DIAGNOSTIC SYSTEM - Study with a Logistic Model and Neural Nets

327

although they can easily be considered. An example

data set with N = 4 and the correlation matrix

= R

−







1.0 −0.3 0.0 0.0

−0.3 1.0 0.0 0.0

0.0 0.0 1.0 0.0

0.0 0.0 0.0 1.0







(30)

is shown in Table 2. The frequency of each outcome

of the tests was calculated according to p(x) and with

the number of cases of D

and D

−

set at 1000.

2.4 Diagnostic Systems

The following three models were examined as the di-

agnostic system.

(1) LG1: Logistic model.

(2) NN1: Neural net with a single layer.

(3) NN2: Neural net with two layers with 5 cells.

Table 2: Test data generated by the simulation.

Outcome of the tests Frequency

−

0 0 0 0 14 104

1 0 0 0 21 69

0 1 0 0 21 69

1 1 0 0 32 46

0 0 1 0 50 112

1 0 1 0 75 75

0 1 1 0 75 75

1 1 1 0 112 50

0 0 0 1 50 112

1 0 0 1 75 75

0 1 0 1 75 75

1 1 0 1 112 50

0 0 1 1 46 32

1 0 1 1 69 21

0 1 1 1 69 21

1 1 1 1 104 14

2.5 Evaluation of Diagnostic

Performance

As the indices of performance of the system, we cal-

culated the Somers’D (Gini coefﬁcient), Goodman-

Kruskal gamma, Kendall’s Tau-a, and the concor-

dance index, c, which are closely related to each other.

We chose to use the value of the concordance index

for each result, because this index is known to give the

area under the receiver operating characteristic (ROC)

curve of the diagnostic system.

2.6 Computation Methods

SAS 9.1.3 was used for logistic analysis and MAT-

LAB (Neural Net Toolbox) was used for the NN1 and

NN2 calculations.

3 RESULTS

For cases with N = 3 − 7, the sensitivity and speci-

ﬁcity were set to 0.6. For each case, computation was

performed under the following three conditions:

(a) Independent: R

= R

−

= I

(b) Positive correlation: R

(1,2) = R

−

(1,2) = 0.3

(1,2) = R

−

(1,2) = −0.3

3.1 Comparison of the Diagnostic

Systems

We did not ﬁnd any signiﬁcant differences among the

three diagnostic systems, LG1, NN1 and NN2. The

results from NN1 are shown in Table 3.

Table 3: Concordance index c.

N (a)Independent (b)Positive R (c)Negative R

3 0.683 0.665 0.697

4 0.710 0.693 0.714

4 0.737 0.720 0.740

4 0.758 0.745 0.761

4 0.759 0.757 0.781

3.2 Effect of the Number of Tests

The concordance index, c, increased as the number

of tests increased. The ROC curve for each case is

shown in Figure 1.

3.3 Effect of Correlation between Tests

As shown in Table 3, the diagnostic performance of

the combined tests was worse in a case of positive cor-

relation between tests and better in a case of negative

correlation, compared to the independent case.

4 CONCLUSIONS

Examination of the improvement of diagnostic per-

formance by combining multiple tests requires an al-

gorithm for generating simulated data with arbitrary

sensitivity, speciﬁcity and inter-test correlations.

NCTA 2011 - International Conference on Neural Computation Theory and Applications

328

Figure 1: ROC curve of the diagnostic system for different

number of tests, N. N=2 (blue), 3 (green), 4 (red), 5 (cyan),

6 (yellow) and 7 (black).

The effects of the number of tests and inter-test

correlations on the diagnostic performance were stud-

ied using a logistic model and neural network models.

The diagnostic performance measured by the con-

cordance index, c, increased as the number of tests

increased. For the same number of tests, the diagnos-

tic performance was reduced by positive correlation

and elevated by negative correlation. Improvement of

the performance was not obtained by increasing the

number of NN layers.

ACKNOWLEDGEMENTS

This study was funded in part by a grant from the

Kitasato University School of Allied Health Sciences

(No. 2010-6604).

REFERENCES

Bahadur, R. R. (1961). A representation of the joint dis-

tribution of responses to n dichotomous items. In

Solomon, H., editor, Studies in Item Analysis and Pre-

diction, pages Chapter 9:158–160. Stanford Univer-

sity Press.

Hara, H. (2008). A logistic analysis of left ventricular ejec-

tion fraction (LVEF) after CRT. In American Heart

Association 2008.

Ikeda, N., Bax, L., Henmi, O., Mamorita, N., Tsuruta, H.,

Shibata, S., and Takeuchi, A. (2007). Study of a lo-

gistic model with mutually correlated variables using

a generation algorithm of dichotomous data with arbi-

trary sensitivity, speciﬁcity and correlation. In MED-

INFO 2007. , Brisbane, Australia. (Proc 2486-2488).

Ikeda, N., Shibata, S., Bax, L., Henmi, O., Mamorita, N.,

Tsuruta, H., and Takeuchi, A. (2006). Diagnostic per-

formance of combined tests using a generation algo-

rithm of multiple tests with arbitrary sensitivity, speci-

ﬁcity and correlation. In MEDSIP 2006. Glasgow,

UK.

EFFECT OF CORRELATION BETWEEN CLINICAL TESTS ON THE PERFORMANCE OF A MULTIPLE

TEST-BASED DIAGNOSTIC SYSTEM - Study with a Logistic Model and Neural Nets

329