Table 4: Analysis of performance with varying number of
neighbors for embedding.
Number of Error using Error using
Neighbors traditional Biased
Isomap Isomap
30 11.56 5.10
50 12.96 5.06
100 13.83 5.03
200 12.59 5.06
500 14.36 5.07
6 CONCLUSION
We have proposed the Biased Manifold Embedding
method, a novel supervised approach to manifold
learning techniques for regression problems. The
proposed method was validated for accurate person-
independent head pose estimation. The use of pose
information in the manifold embedding process im-
proved the performance of the pose estimation pro-
cess significantly. The pose angle estimates obtained
using this method are accurate, and can be relied upon
with an error margin of 3-4
◦
. Our experiments also
demonstrated that the method is robust to variations in
feature spaces, dimensionality of embedding and the
choice of the number of neighbors for the embedding.
The proposed method can easily be extended from the
current Isomap implementation to cover the envelop
of other manifold learning techniques, and can be de-
veloped as a framework for biased manifold learning
to cater to all regression problems at large.
6.1 Limitations and Future Work
As mentioned earlier, a significant drawback of man-
ifold learning techniques is the lack of a projection
matrix to treat new data points. While we used the
GRNN to learn the non-linear mapping in this work,
there have been other approaches adopted by various
researchers. Bengio et al (Bengio et al., 2004) pro-
posed a mathematical formulation focussed to over-
come this problem. We plan to use these approaches
to support the validity of our approach. Besides, we
intend to extend the Biased Manifold Embedding im-
plementation to LLE and Laplacian Eigenmaps to es-
tablish it as a framework for non-linear dimensional-
ity reduction in regression applications. On a lesser
significant note, another limitation of the current ap-
proach is that the number of neighbors chosen to ob-
tain the embeddding has to be more than the num-
ber of individuals in the face images. This is because
different individuals with the same pose angle are as-
signed a zero distance value in the biased geodesic
distance matrix. We plan to modify our algorithm to
overcome this limitation. In addition, the function of
pose distance used to bias the geodesic distance ma-
trix can be varied to study the applicability of different
reciprocal functions for pose estimation.
REFERENCES
Bengio, Y., Paiement, J. F., Vincent, P., and Delalleau, O.
(2004). Out-of-sample extensions for lle, isomap,
mds, eigenmaps, and spectral clustering.
Brown, L. and Tian, Y.-L. (2002). Comparative study of
coarse head pose estimation.
Chen, L., Zhang, L., Hu, Y. X., Li, M. J., and Zhang, H. J.
(2003). Head pose estimation using fisher manifold
learning. In IEEE International Workshop on Analy-
sis and Modeling of Faces and Gestures (AMFG03),
pages 203–207, Nice, France.
Fu, Y. and Huang, T. S. (2006). Graph embedded analysis
for head pose estimation. In 7th International Con-
ference on Automatic Face and Gesture Recognition,
Southampton, UK.
Hu, N., Huang, W., and Ranganath, S. (2005). Head pose
estimation by non-linear embedding and mapping. In
IEEE International Conference on Image Processing,
page 342345, Genova.
Little, G., Krishna, S., Black, J., and Panchanathan, S.
(2005). A methodology for evaluating robustness
of face recognition algorithms with respect to vari-
ations in pose and illumination angle. In Proceed-
ings of IEEE International Conference on Acoustics,
Speech and Signal Processing, page 8992, Philadel-
phia, USA.
Raytchev, B., Yoda, I., and Sakaue, K. (2004). Head
pose estimation by nonlinear manifold learning. In
17th International Conference on Pattern Recognition
(ICPR04), Cambridge, UK.
Ridder, D. d., Kouropteva, O., Okun, O., Pietikainen, M.,
and Duin, R. P. (2003). Supervised locally linear em-
bedding. In International Conference on Artificial
Neural Networks and Neural Information Processing,
333341.
Tanenbaum, J. B., Silva, V. d., and Langford, J. C. (2000).
A global geometric framework for nonlinear dimen-
sionality reduction. Science, 290(5500):23192323.
Wenzel, M. T. and Schiffmann, W. H. (2005). Head pose
estimation of partially occluded faces. In Second
Canadian Conference on Computer and Robot Vision
(CRV05), page 353360, Victoria, Canada.
Yu, J. and Tian, Q. (2006). Learning image manifolds by
semantic subspace projection. In ACM Multimedia,
Santa Barbara, CA, USA.
Zhao, Q., Zhang, D., and Lu, H. (2005). Supervised lle in
ica space for facial expression recognition. In Interna-
tional Conference on Neural Networks and Brain (IC-
NNB05), volume 3, page 19701975, Beijing, China.
VISAPP 2007 - International Conference on Computer Vision Theory and Applications
82