Region of intrest
−2
−1
0
A priori landscape
−30
−20
−10
Likelihood landscape
−100
−80
−60
−40
−20
MAP landscape
−150
−100
−50
Figure 4: The top row shows an eye on the left and on
the right the a priori landscape is presented. The bottom
row presents the likelihood landscape on the left and finally
the MAP landscape. The blue circles denote the maximum
in that landscape while red dots are the landscape maxima,
shown for easy reference.
In Figure 4 a detailed example for one landmark,
an eye, can be seen. In the upper right corner we see
a region of interest for the eye. The upper right shows
the a priori landscape for this landmark. The lower
left corner shows the likelihood-ratio landscape. This
is the sum of the first two terms of Equation 9. Finally,
in the lower right corner the resulting MAP landscape
is shown. In each landscape the location of the max-
imum value is denoted by a large blue circle. In each
of the four images the maxima of the three landscapes
are plotted as a red dot for easy reference. The max-
ima of the likelihood ratio landscape and the MAP
landscape are shown as the two red dots. It can clearly
be seen that because of the influence of the q(
~
s) MAP
gives a better estimate of the centre of the eye. It is,
however, also true that the final influence of the q(
~
s)
term is not always as substantial as in Figure 4. MAP
improves the result here because the likelihood land-
scape is rather uniform in the entire eye region. It
is reasonably save to assume that a part of the im-
provement with regards to MLLL and BILBO can be
attributed to better implementation of the feature ex-
traction then the one in the original MLLL. If a likeli-
hood ratio landscape has a sharp maximum, as in high
quality images, there is not much contribution of the
shape probability to the final MAP landscape.
Finally, two interesting observations can be made.
First, applying BILBO is not always beneficial. Ap-
plying BILBO to the MLLL and MAP data makes in-
creases the error for the mouth except for MLLL in
the high quality images. This can be seen in both
Table 2 and Figure 3. Possible explanations for this
effect can be that landmark locations on the mouth
are falsely corrected by BILBO because of grand er-
rors on for instance the nose. Secondly, it is clear that
BILBO does not significantly improves MAP while
it does improve the results of the MLLL algorithm.
From this we can conclude that MAP has less severe
outliers and that the limit of BILBO is being reached.
Also, with MAP the shape is already taken into ac-
count more than with MLLL alone. On the other
hand, the fact that using a shape based outlier detec-
tion sometimes still improves the results proves that
there is still room for improvement of the current im-
plementation of the MAP algorithm.
5.1.1 Future Improvements
This method can be further improved by dropping
the assumption made in Section 3 that the landmarks
are independent. This requires a more elaborate op-
timization method for solving Equation 2. Also the
quality of the training data is not sufficient. The
BioID database only contains landmarks from 22 per-
sons. Training on a different database with a bigger
variety of persons might improve the results signifi-
cantly.
Further improvement can be reached by making
an iterative implementation of the algorithm. In the
current implementation, the algorithm trained solely
on registered images. The algorithm responds works
less good on unregistered faces. This has a twofold
negative effect. First the likelihood landscape is cal-
culated for a type of image it has not been trained
to recognize. Secondly the probability distribution of
the shape assumes the head to be aligned. When the
MAP algorithm is used iteratively, an image is better
aligned each run. Thus it has a better fit to the model,
improving the overall accuracy.
6 CONCLUSIONS
We formulated a solid MAP frame work for finding
the landmarks in a facial image. Our implementation,
however, is only a first step towards a complete MAP
landmark location estimator. It shows that using the
MAP probability actually improves the performance
of the MLLL and BILBO algorithms on frontal still
images. MAP has turned out to be more robust be-
cause the performance on the low quality images im-
proved a lot, narrowing the performance gap with the
high quality images. The assumption that we made
that the landmark locations are independent is incor-
rect. The next step will be to introduce the dependen-
cies between the landmarks in order to improve the
estimates of q(
~
s). Also making an iterative implemen-
tation can improve the MAP approach. Nevertheless
the results are promising.
BIOSIGNALS 2009 - International Conference on Bio-inspired Systems and Signal Processing
16