Table 5: Error rates using the characters A, B, C, D, P, I,
J, K, L... and using the time interval as the distance metric
(distance classes set automatically).
r F-accept G-reject FAR FRR ER
0.0 23/325 100/609 7.1% 16.4% 23.5%
0.5 33/325 59/609 10.2% 9.7% 19.8%
1.0 46/325 35/609 14.2% 5.7% 19.9%
1.5 64/325 26/609 19.7% 4.3% 24.0%
2.0 79/325 13/609 24.3% 2.1% 26.4%
larly the next higher distances were represented by K
and L. A pen-up character P was also included in the
string whenever the pen tip was not in contact with
the writing tablet. Pen-up time was divided into small
and large, where small pen-ups were represented by
a single P and large pen-ups represented as PP. The
limits for each of the four distance classes were mutu-
ally calculated using the distance values for all users
in the database. These limits were found by building
a frequency vector for the distance values, and then
dividing the distance values into four equal groups
based on their magnitude, and assigning a symbol to
each group. The limits for the distance classes were
then used globally on the signature database to ex-
plore this approach’s merit. The initial limits for the
four classes for time and path length distances, and
for pen-up occurrences were:
Time: 0 − 7; 8− 15; 16− 24; > 24
Path Length: 0− 25; 26− 60; 61− 120; > 120
Pen-Up: < 8; > 8
String representations of signatures were gener-
ated using the distance characters and the turning
point characters. Performance evaluation of the tech-
nique using the time interval and the path length as
the distance metric resulted in overall error rates of
32.2% and 33.9% respectively (see Table 3 and Ta-
ble 4).
The above classes used in developing representa-
tions for the distance and the pen-up time were based
on heuristics. Since the distances and the pen-up
times tend to be quite different for different individu-
als, it was decided to select the classes based on typi-
cal distance values for each individual. To do this, five
reference signatures for each individual were used,
and all of the distance values in between turning point
characters were obtained. These distance values were
divided into four equal groups according to their mag-
nitude and each group assigned a symbol. The small
and large pen-up time intervals were also found in
a similar fashion by dividing the pen-up time sizes
into two equal groups according to magnitude. The
performance was evaluated using the individualised
intervals. The results obtained were an 19.8% over-
Table 6: Error rates using the characters A, B, C, D, P, I,
J, K, L... and using the path length as the distance metric
(distance classes set automatically).
r F-accept G-reject FAR FRR ER
0.0 25/325 102/609 7.7% 16.7% 24.4%
0.5 36/325 58/609 11.1% 9.5% 20.6%
1.0 48/325 37/609 14.8% 6.1% 20.9%
1.5 66/325 27/609 20.3% 4.4% 24.7%
2.0 81/325 13/609 24.9% 2.1% 27.0%
all error rate using the time interval as the distance
metric and 20.6% using the path length. Table 5 and
Table 6 show that the best results are obtained when
the threshold value is 0.5. Although these results are
much better than those in Table 3, the FAR is still
high. From the results presented thus far, the time in-
terval seems to be the superior distance metric of the
two. Therefore we only concentrate on the time inter-
val as the distance metric from this point on.
The results show that the FAR was still dominat-
ing the overall error rate indicating that shape was still
the determining factor in the technique, since includ-
ing the distance only improved the shape representa-
tion, but did not include any motion or dynamic in-
formation. One type of information that can be easily
derived is the direction of pen motion at each moment.
It was decided to incorporate details of the direction
of the pen tip in between the valley and peak charac-
ters. To include pen tip direction, the motion direc-
tion was classified into four classes, corresponding to
quadrants on a Cartesian grid. The character to in-
clude between the turning points α, and the turning
point β, can then be found by positioning α at the ori-
gin and determining which quadrant β lies in.
Both the direction and the distance were included
in the signature representation by using one symbol
to represent both. To do this, for example, quadrant
one and distance class one is represented by the char-
acter E, quadrant two and the distance class one is the
character F and so on, until there was a unique charac-
ter for each of the sixteen direction/distance combina-
tions. Table 7 presents the results of evaluating these
two techniques. The results are much better with the
minimum total error at 9.2%.
An extension of the distance/direction approach
involved attaching more weight to longer strokes.
This was done by inserting not only the character ap-
propriate for each stroke, but also the distance char-
acteristic is considered, and the four distance classes
(in order) are represented by the characters W, X, Y
and Z. If the distance between two valleys/peaks falls
into the second largest class Y, then the three char-
acters WXY would be inserted in between the val-
ley/peak characters. This approach was considered
SECRYPT 2007 - International Conference on Security and Cryptography
54