is equal to False Rejection Rate, therefore the lower,
the better. Values range from 1.6% with the lighter
shoes to 5% with the heavier ones.
The work presented in (Gafurov et al., 2006) pro-
poses two different approaches for gait recognition,
tested on a in-house dataset of 21 users with only one
single walk each, further divided into two parts. This
causes a lack of most intraclass variation factors, and
therefore the dataset hardly represents a realistic sce-
nario. In real life settings, even the position of the
sensors cannot be completely controlled and is not
identical, and this fact in itself can produce variations
in the captured signal. For both proposed methods,
the 3-dimensional raw signals from the accelerometer
are combined into a single 1-dimensional vector using
the following ad hoc formula: v
i
= arcsin(
z
i
√
x
2
i
+y
2
i
+z
2
i
)
where i represents the index of the sample within the
signal. The first matching strategy exploits HS. To
this aim, the obtained values are stored in a histogram
representing the derived biometric template. Match-
ing between the obtained histograms achieves a 5% of
EER. The second attempt uses the 1D vector for cycle
comparison and achieves a 9% of EER.
In (De Marsico and Mecca, 2016), the authors
present a novel step segmentation procedure and show
the performance of five different algorithms based on
DTW, one dealing with the entire signal and the oth-
ers using different strategies to match the detected
steps. In this case no fixed threshold is used for seg-
mentation. The algorithm exploiting the entire signal
achieves 92.8% of Recognition Rate (RR) in closed
set identification modality and 9.26% of EER in ver-
ification modality; the algorithms exploiting the step
segmentation procedure achieve up to 82.7% of RR
and up to 10.3% of EER. It is to underline that results
in this latter work are obtained over a very large pub-
lic dataset including 175 subjects with 12 walks each
(Zhang et al., 2015).
The approaches in the second group, instead, try
to exploit machine learning algorithms/classifiers in
order to get the correct match between templates.
These proposals generally work only in verification
modality, training a classifier for each subject. A com-
mon pre-processing step is to fragment the signal in
chunks with a fixed length (in terms of either time or
number of samples, with or without overlap) in order
to extract more data for the training phase. Of course,
these approaches do not require any step/cycle seg-
mentation procedure. After the training of the clas-
sifier, that can occur on a more powerful device, the
trained system can be executed directly on a smart-
phone due to the low computational cost of the sin-
gle recognition operation. In fact, several works in
this category are generally designed to be executed di-
rectly on the mobile device in order to unlock it only
for its owner, as an alternative to pins or passwords.
Two solutions in this category are presented in
(Nickel et al., 2011b) and in (Nickel et al., 2011a).
Both works exploit the same dataset, collected by a
Google G1 phone. Such dataset contains walk signals
from 48 subjects, 4 walks each. In the first work, sig-
nals are re-sampled to 200Hz. Afterwards, they are
divided into fragments of 3 seconds with no overlap.
Fragments are then grouped into two sets, one for the
training and the other one for testing. The recognition
is carried out by the HVITE tool and each subject is
used one time as genuine and forty-seven times as im-
postor. This strategy reports an EER of about 10%.
In the second work, the walking signals are inter-
polated to 100Hz and then are fragmented into chunks
of 7 seconds with an overlap of 50%. Each fragment
is used as feature vector, adding some extra statistical
parameters, and the Mel and the Bark frequency cep-
stral coefficients. Training and recognition are car-
ried out exploiting the SVM classifier. This approach
achieves a 5.9% of False Match Rate (FMR) with
6.3% of False Non Match Rate (FNMR).
In (Nickel et al., 2012), the authors try to exploit
the k-NN algorithm for recognition. Walking data are
collected during two sessions. Each of the 36 users
is asked to walk 12 times at normal pace, 16 times
at fast pace and again other 12 times at normal pace
on a flat hallway. Each such group of walk signals is
captured by a single recording operation. Single sig-
nals are divided using an automatic procedure accord-
ing to some stop periods decided in advance, and the
result is eventually manually corrected. All signals
are then interpolated at 127Hz (this value is empiri-
cally chosen). After this preprocessing, the interpo-
lated signals are fragmented. This work exploits three
different fragment sizes, namely 3s, 5s, and 7.5s. In
all three cases, the fragments have an overlap of 50%.
The feature vectors are then created using some sta-
tistical parameters and by Mel and Berk coefficients.
The recognition exploits the implementation of k-NN
algorithm included in the WEKA library. The work
reports a FMR of about 4% with a very high FNMR
of about 22-23%, resulting in a Half-Total Error Rate
(HTER - the average of the two) of about 13%. In or-
der to improve performances, the authors try a voting
approach using different fragments of the same sub-
ject; this significantly reduces the FNMR while in-
creasing minimally the FMR, so achieving an HTER
of about 8.5%.
The proposal in (Zhang et al., 2015) presents a
large dataset of 175 subjects with 12 walks per per-
son (the same used as benchmark by our proposal)
and tests the extraction and the use of signature points
ICPRAM 2018 - 7th International Conference on Pattern Recognition Applications and Methods
632