Table 1: The comparison of robustness P
s
and accuracy (E
pan
, E
tilt
, E
roll
, E
avg
) between our method and state-of-the-art on
uniform-light set of BUFT dataset.
Approach P
s
E
pan
E
tilt
E
roll
E
avg
(Cascia et al., 2000) 75% 5.3 5.6 3.8 3.9
(Xiao et al., 2003) 100% 3.8 3.2 1.4 2.8
(Lefevre and Odobez, 2009) 100% 4.4 3.3 2.0 3.2
(Morency et al., 2008) 100% 5.0 3.7 2.9 3.9
(Saragih et al., 2011) 92% 3.9 3.9 2.3 3.4
Mahalanobis Distance (Chen and Davoine, 2006) 85% 5.1 3.8 2.0 3.6
PCA + Mahalanobis Distance 87% 5.2 3.5 2.0 3.6
Our model 89% 4.7 3.5 2.1 3.4
variations of appearance around landmarks. Although
performance is improved by our method, it remains
quite far from state-of-the-art methods. However, it
could be more efficient if the dynamical model or
state evolution was taken into account. Or the weights
of contribution to energy function were dependent on
the confidence of landmark observations at each time.
Finally the observation model could also be adapted
to changes through frames and be made more robust
for face tracking and pose estimation.
ACKNOWLEDGEMENTS
This work was financially supported by the ANR-
CONTINT-ORIGAMI2 project (ANR-10-CORD-
0016) and the LTCI of the Telecom-ParisTech
Institute, France.
REFERENCES
Ahlberg, J. (2001). Candide-3 - an updated parameterised
face. Technical report, Dept. of Electrical Engineer-
ing, Linkping University, Sweden.
Cascia, M. L., Sclaroff, S., and Athitsos, V. (2000). Fast,
reliable head tracking under varying illumination: An
approach based on registration of texture-mapped 3d
models. IEEE Trans. PAMI, 22(4):322–336.
Chen, Y. and Davoine, F. (2006). Simultaneous tracking of
rigid head motion and non-rigid facial animation by
analyzing local features statistically. In BMVC.
Cootes, T. F., Edwards, G. J., and Taylor, C. J. (1998). Ac-
tive appearance models. TPAMI, pages 484–498.
Cootes, T. F. and Taylor, C. J. (1992). Cj.taylor, ”active
shape models - ”smart snakes. In BMVC.
DeCarlo, D. and Metaxas, D. N. (2000). Optical flow con-
straints on deformable models with applications to
face tracking. IJCV, 38(2):99–127.
Dementhon, D. F. and Davis, L. S. (1995). Model-based
object pose in 25 lines of code. IJCV, 15:123–141.
Elad, M. and Aharon, M. (2006). Image denoising via
sparse and redundant representations over learned dic-
tionaries. IEEE Transactions on Image Processing,
15(12):3736–3745.
Gross, R., Matthews, I., and Baker, S. (2006). Active ap-
pearance models with occlusion. IVC, 24(6):593–604.
Lefevre, S. and Odobez, J.-M. (2009). Structure and ap-
pearance features for robust 3d facial actions tracking.
In ICME.
Mei, X. and Ling, H. (2011). Robust visual tracking and
vehicle classification via sparse representation. IEEE
Trans. PAMI, 33(11):2259–2272.
Morency, L.-P., Whitehill, J., and Movellan, J. R. (2008).
Generalized adaptive view-based appearance model:
Integrated framework for monocular head pose esti-
mation. In FG.
Nelder, J. A. and Mead, R. (1965). A simplex algorithm
for function minimization. Computer Journal, pages
308–313.
Saragih, J. M., Lucey, S., and Cohn, J. F. (2011). De-
formable model fitting by regularized landmark mean-
shift. IJCV, 91:200–215.
Tibshirani, R. (1996). Regression shrinkage and selection
via the lasso. Journal of the Royal Statistical Society
(Series B), 58:267–288.
Vacchetti, L., Lepetit, V., and Fua, P. (2004). Stable real-
time 3d tracking using online and offline information.
IEEE Trans. PAMI, 26(10):1385–1391.
Wright, J., Yang, A., Ganesh, A., Sastry, S., and Ma, Y.
(2009). Robust face recognition via sparse represen-
tation. TPAMI, 31(2):210 –227.
Xiao, J., Baker, S., Matthews, I., and Kanade, T. (2004).
Real-time combined 2d+3d active appearance models.
In CVPR.
Xiao, J., Moriyama, T., Kanade, T., and Cohn, J. (2003).
Robust full-motion recovery of head by dynamic tem-
plates and re-registration techniques. International
Journal of Imaging Systems and Technology, 13:85 –
94.
Yang, A. Y., Ganesh, A., Zhou, Z., Sastry, S., and Ma, Y.
(2010). A review of fast l1-minimization algorithms
for robust face recognition. CoRR, abs/1007.3753.
Yb
´
a
˜
nez-Zepeda, J. A., Davoine, F., and Charbit, M. (2007).
Local or global 3d face and facial feature tracker. In
ICIP, volume 1, pages 505–508.
3DFacePoseTrackingfromMonocularCameraviaSparseRepresentationofSynthesizedFaces
333