In the considered case in the paper, for eight
component face contours of interest, n = 194 such
points have been determined (Fig. 1a and Tab. 1)
and this implies 388 dimensional shape space.
2.1 Extracting and Calculation of
Contours
The stages of applied procedure to obtain normalised
contours are presented in Fig. 1. Images used to
contour extracting are presented in Fig. 2. First, two
landmarks are positioned on the face image, in the
external eye corners. Next, the initial contour
(template) is placed on the image, according to the
landmark positions (Fig. 1a). The landmarks
determine a face pose and an image scale. In the
next step, the contour is manually drawing (fitting)
on place, which seems to the operator as the best
localisation for the contour point positions (Fig. 1b).
Subsequently, the derived contours (Fig. 1c) are
normalised. Scale coefficient results from the
calculated coordinates of eye centres (pupils). This
is connected to applied active shape procedure,
where the initial contour is generally placed
according to expected pupils positions. Pupil
coordinates are calculated from coordinates of
contour samples located in the eyelid corners. X-axis
is determined by pupil coordinates; points (-1,0),
(1,0) are located on right and left pupil, respectively.
The symmetrical of this section determines Y-axis
and the middle of coordinate system. Next, the
contour points are projected on a normalised
coordinate system. The normalised contour has to be
uniformly sampled (manually extracted contours
have nonuniform distances between the adjoining
points). The normalised and uniformly sampled
contours are presented in Fig. 1d. During the
normalisation procedure, points are ordered in
a defined sequence, according to the feature vector
definition (1). In the presented approach, the height
standardizations of face and nose outlines have not
been applied (Fig. 3).
3 EXPERIMENT
In order to select classifiers for ASM method, an
experiment consisting of examining a set of face
images was undertaken. Color images of 2048 ×
1536 pixels were used. For 100 persons (N = 100
classes) the following images were taken:
A – sequence of 30 frames for horizontal head
rotation from the right to the left half-profile;
B – sequence of 20 frames for vertical face rotation
from slightly risen to hanged down head
position;
C – 10 frames for different head position and
limited face mimicry.
The contours were prepared by over a dozen
persons. A person chosen to work on C-frames has
not seen the contours resulting from A and B frames.
The contours were positioned on 11 internal images
from A-frames, on 11 internal images from B-frames
and on 5 images chosen from C-frames (Fig. 2). In
presented experiment 2700 contours were used.
3.1 Set Definitions
The normalised contours were divided into the
following sets:
LS22 – learning set, 2200 contours, 22 for each
class from A- and B-frames;
LS11A – learning set, 1100 contours, 11 for
each class, even subset of LS22;
LS11B – learning set, 1100 contours, 11 for
each class, odd subset of LS22;
LS06 – learning set, 600 contours, 6 for each
class, subset of LS22 with face poses nearest
to en face position;
VS05 – validation set, 500 contours, 5 for each
person from C-frames.
LS11A and LS11B sets were used as learning or
testing sets alternatively.
3.2 Classifiers
Two classifying methods were tested. The first
classifier was Nearest Neighbourhood Classifier in
reduced shape subspace derived from LDA. As
a metric, Euclidean distance to a model of class in
99-dimensional subspace was used. The second
method was taken as the SVM method with kernel
such as Radial Basis Function. The classification of
x sample from testing or learning sets was based on
a voting procedure. In presented approach, a total
number of votes is equal to N (N - 1)/2, where N is
the number of classes. The maximal number of votes
to one class is equal to (N - 1) and in our experiment
it is only 2% of total number of votes. The voting
decision depends only on the sign of discrimination
function for x sample coordinates. In the case of
a pair of “very similar classes”, only one vote from
(N - 1) decisions can decide. In the presented
CLASSIFIER SELECTION FOR FACE RECOGNITION ALGORITHM BASED ON ACTIVE SHAPE MODEL
277