Bulat, A. and Tzimiropoulos, G. (2017). How far are
we from solving the 2d & 3d face alignment prob-
lem?(and a dataset of 230,000 3d facial landmarks).
In Proceedings of the IEEE International Conference
on Computer Vision, pages 1021–1030.
Cao, X., Wei, Y., Wen, F., and Sun, J. (2014). Face align-
ment by explicit shape regression. International jour-
nal of computer vision, 107(2):177–190.
Chang, F.-J., Tuan Tran, A., Hassner, T., Masi, I., Nevatia,
R., and Medioni, G. (2017). Faceposenet: Making a
case for landmark-free face alignment. In Proceed-
ings of the IEEE International Conference on Com-
puter Vision Workshops, pages 1599–1608.
Chen, D., Ren, S., Wei, Y., Cao, X., and Sun, J. (2014).
Joint cascade face detection and alignment. In Euro-
pean conference on computer vision, pages 109–122.
Springer.
DeMenthon, D. F. and Davis, L. S. (1995). Model-based
object pose in 25 lines of code. International journal
of computer vision, 15(1-2):123–141.
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-
Fei, L. (2009). Imagenet: A large-scale hierarchical
image database. In 2009 IEEE conference on com-
puter vision and pattern recognition, pages 248–255.
Ieee.
Deng, J., Guo, J., Zhou, Y., Yu, J., Kotsia, I., and Zafeiriou,
S. (2019). Retinaface: Single-stage dense face locali-
sation in the wild. arXiv preprint arXiv:1905.00641.
Fanelli, G., Weise, T., Gall, J., and Van Gool, L. (2011).
Real time head pose estimation from consumer depth
cameras. In Joint pattern recognition symposium,
pages 101–110. Springer.
Gao, S., Cheng, M.-M., Zhao, K., Zhang, X.-Y., Yang, M.-
H., and Torr, P. H. (2019). Res2net: A new multi-scale
backbone architecture. IEEE transactions on pattern
analysis and machine intelligence.
Gu, J., Yang, X., De Mello, S., and Kautz, J. (2017). Dy-
namic facial analysis: From bayesian filtering to re-
current neural network. In Proceedings of the IEEE
conference on computer vision and pattern recogni-
tion, pages 1548–1557.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep resid-
ual learning for image recognition. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 770–778.
Hinton, G., Vinyals, O., and Dean, J. (2015). Distilling
the knowledge in a neural network. arXiv preprint
arXiv:1503.02531.
Hsu, G.-S., Huang, W.-F., and Yap, M. H. (2019). Edge-
embedded multi-dropout framework for real-time face
alignment. IEEE Access, 8:6032–6044.
Huang, J., Shao, X., and Wechsler, H. (1998). Face pose
discrimination using support vector machines (svm).
In Proceedings. fourteenth international conference
on pattern recognition (Cat. No. 98EX170), volume 1,
pages 154–156. IEEE.
Jones, M. and Viola, P. (2003). Fast multi-view face detec-
tion. Mitsubishi Electric Research Lab TR-20003-96,
3(14):2.
Kazemi, V. and Sullivan, J. (2014). One millisecond face
alignment with an ensemble of regression trees. In
Proceedings of the IEEE conference on computer vi-
sion and pattern recognition, pages 1867–1874.
Kumar, A., Alavi, A., and Chellappa, R. (2017). Kepler:
Keypoint and pose estimation of unconstrained faces
by learning efficient h-cnn regressors. In 2017 12th
ieee international conference on automatic face &
gesture recognition (fg 2017), pages 258–265. IEEE.
Lathuili
`
ere, S., Juge, R., Mesejo, P., Munoz-Salinas, R., and
Horaud, R. (2017). Deep mixture of linear inverse
regressions applied to head-pose estimation. In Pro-
ceedings of the IEEE Conference on Computer Vision
and Pattern Recognition, pages 4817–4825.
Martin, M., Van De Camp, F., and Stiefelhagen, R. (2014).
Real time head model creation and head pose estima-
tion on consumer depth cameras. In 2014 2nd Inter-
national Conference on 3D Vision, volume 1, pages
641–648. IEEE.
Meyer, G. P., Gupta, S., Frosio, I., Reddy, D., and Kautz, J.
(2015). Robust model-based 3d head pose estimation.
In Proceedings of the IEEE international conference
on computer vision, pages 3649–3657.
Mukherjee, S. S. and Robertson, N. M. (2015). Deep head
pose: Gaze-direction estimation in multimodal video.
IEEE Transactions on Multimedia, 17(11):2094–
2107.
Murphy-Chutorian, E., Doshi, A., and Trivedi, M. M.
(2007). Head pose estimation for driver assistance
systems: A robust algorithm and experimental evalua-
tion. In 2007 IEEE intelligent transportation systems
conference, pages 709–714. IEEE.
Murphy-Chutorian, E. and Trivedi, M. M. (2008). Head
pose estimation in computer vision: A survey. IEEE
transactions on pattern analysis and machine intelli-
gence, 31(4):607–626.
Ng, J. and Gong, S. (2002). Composite support vector ma-
chines for detection of faces across views and pose es-
timation. Image and Vision Computing, 20(5-6):359–
368.
Niyogi, S. and Freeman, W. T. (1996). Example-based head
tracking. In Proceedings of the second international
conference on automatic face and gesture recognition,
pages 374–378. IEEE.
Ranjan, R., Patel, V. M., and Chellappa, R. (2017a). Hy-
perface: A deep multi-task learning framework for
face detection, landmark localization, pose estimation,
and gender recognition. IEEE transactions on pattern
analysis and machine intelligence, 41(1):121–135.
Ranjan, R., Sankaranarayanan, S., Castillo, C. D., and Chel-
lappa, R. (2017b). An all-in-one convolutional neu-
ral network for face analysis. In 2017 12th IEEE In-
ternational Conference on Automatic Face & Gesture
Recognition (FG 2017), pages 17–24. IEEE.
Ruiz, N., Chong, E., and Rehg, J. M. (2018). Fine-grained
head pose estimation without keypoints. In Proceed-
ings of the IEEE conference on computer vision and
pattern recognition workshops, pages 2074–2083.
Schwarz, A., Haurilet, M., Martinez, M., and Stiefelhagen,
R. (2017). Driveahead-a large-scale driver head pose
An Effective Deep Network for Head Pose Estimation without Keypoints
97