in which the patients are clustered according to BBS
improvement using statistically most significant pa-
rameters. It is often argued that proper administration
of intrahospital as well as post-recovery procedures
can significantly improve the recovery of patients. To
this purpose it would be extremely beneficial to prop-
erly triage (cluster) patients at the admission stage in
order to ensure optimal distribution of resources. We
then evaluate the proposed clustering procedure on a
data sample consisting of 203 patients that have been
admitted to the Institute for Rehabilitation, Belgrade,
Serbia.
The paper is organized as follows. In Section II we
describe the data set and the proposed classification
models. In Section III we evaluate the applicability of
the proposed algorithm using a real data set. Finally,
in Section IV we discuss the results and future work.
2 SIGNAL PROCESSING
MODELS
2.1 Data Set
The prospective study included 203 patients with hip
fractures that were referred into the specialized re-
habilitation institution for the rehabilitation program
and admitted from january 2011 until June 2011.
Prior to the inclusion into the study, eligible partici-
pants were informed about the study protocol and re-
habilitation program with possible contraindications
that might arise over the course of treatment. The
informed consent was obtained prior to the inclu-
sion. The study was approved by the Institutional
Review Board. Implementation of rehabilitation pro-
gram was individually prescribed with respect to the
patients functional status and continuously monitored
for early identification of possible complications that
could arise during the treatment. Functional status of
every individual in the program was evaluated by the
Berg Balance Scale test on 3 occasions: at the ad-
mission (Group 1), at discharge from the rehabilita-
tion facility (Group 2) and 3 months after discharge
(Group 3). Berg Balance Scale test evaluated 14 tasks
(5 static and 9 dynamic) that are graded as 5 points
scale with the range from 0 to 4, to the maximal value
for the summarized scores of 56 (Stevenson et al.,
2010). Ability to predict falls in elderly population
suggests the validity of BBS test (9). The BBS is
used to measure functional balance that is composed
of 3 dimensions: position maintaining, postural ad-
justment to voluntary movements and reaction to ex-
ternal disturbance (Berg et al., 1995).
2.2 Preprocessing
We organize the data set in a database consisting
of 203 rows corresponding to the patients and 33
columns of different features (age, height, weight,
respiratory conditions, heart conditions, BBS at the
admission, BBS at the discharge, BBS three months
after discharge, etc.) Then we analyze cross-
correlation between all the features and extract sta-
tistically significant ones using Pearson coefficient.
In order to study dynamics of rehabilitation we use
log-values of BBS score ratios. The rationale behind
this approach is that we expect exponential change in
balance improvement and thus log (semi-log) models
may represent better fit.
2.3 Clustering Algorithm
Once statistically significant features have been se-
lected the problem reduces to clustering of m-
dimensional vectors into a set of pre-determined clus-
ters. In order to determine appropriate use of clinical
resources as a preliminary approach we proposeto de-
termine which patients have largest/smallest capacity
for recovery. We propose to determine the significant
parameters using Spearman correlation coefficient
which is commonly used technique in cases/models
where nonlinearity is expected. We then propose to
cluster all the patients into several clusters. We inves-
tigate two possible scenarios in this paper: a) two-
cluster scenario consisting with high rate recovery
and low rate recovery patients, and b) three-cluster
scenario - low, medium, and high rates of recovery.
Note that the number of clusters can be arbitrarily set
and is usually controlled by the overall error of clas-
sification. In addition, the quality of health care and
resource management can be relatively robust to the
overall error of classification and thus the overall re-
sults in treatments may not change significantly for
small variations in number of clusters.
In order to cluster the data set we propose to use
fuzzy clustering based on Gath-Geva (Gath and Geva,
1989) clustering which uses Gaussian distance and
consequently assumes that the data set arises from
mixture of Gaussian distributions. A general outline
of the algorithms is as follows: a) arbitrarily assign
each patient to a cluster i.e. arbitrarily pick if the pa-
tient is high or low rate recovery. Note also that the
preliminaryclustering can be either done arbitrarily or
using a hard clustering algorithms such as K-means,
b) update cluster centers, c) reassign objects to the
clusters to which the objects are most similar, d) re-
peat until no change by reassignment. The update of
clustering matrix is done using the following approa-
ANALYSIS OF BERG BALANCE SCALE IN HIP FRACTURE PATIENTS USING FUZZY CLUSTERING
467