Table 3: Accuracy vs. normalized error in the Gi4E database.
Method Accuracy (%)
e ≤ 0.025 e ≤ 0.05 e ≤ 0.1
DeepPupil Net 98.37 99.91 100
Lee
2020
(Lee et al., 2020) 93.00
?
99.84 99.84
Choi
2020
(Choi et al., 2020) 90.40 99.60 99.84
Xia
2019
(Xia et al., 2019) 70.00
?
99.10 100
Xiao
2018
(Xiao et al., 2018) 70.00
?
97.90 100
Levinshtein
2018
(Levinshtein et al., 2018) 88.34 99.27 99.92
Cai
2018
(Cai et al., 2018) 85.70 99.50 -
Table 4: DeepPupil Net performance for different architectures in the BioID database.
Network Stages Number of Parameters Time Accuracy (%)
e ≤ 0.05 e ≤ 0.1 e ≤ 0.25
2 0.43M 13.2ms 94.51 99.50 100
3 1.65M 15.0ms 98.00 100 100
4 6.57M 18.0ms 97.86 100 100
work achieves real-time performance as it requires
only 15ms, in Matlab implementation, to process both
the eyes for every input image.
5 CONCLUSIONS
In this paper, the DeepPupil Net, a FCN that solved
in an accurate and robust manner the eye center lo-
calization problem is introduced. This network con-
sists of an encoder-decoder based architecture and
was trained end-to-end to localize precisely the eye
centers even in the most challenging circumstances.
An extensive evaluation of the proposed method on
three publicly available databases demonstrated a sig-
nificant improvement in accuracy over state-of-the-art
techniques. Moreover, due to its reduced processing
time, DeepPupil Net can be incorporated in low-cost
eye trackers, where the real-time performance is pre-
requisite.
REFERENCES
Anjith, G. and Routray, A. (2016). Fast and accurate al-
gorithm for eye localization for gaze tracking in low
resolution images. arXiv preprint arXiv:1605.05272.
Cai, H., Liu, B., Ju, Z., Thill, S., Belpaeme, T., Vander-
borght, B., and Liu, H. (2018). Accurate eye center
localization via hierarchical adaptive convolution. In
British Machine Vision Conference (BMVC). British
Machine Vision Association.
Cai, H., Liu, B., Zhang, J., Chen, S., and Liu, H. (2017).
Visual focus of attention estimation using eye center
localization. IEEE Systems Journal, 11.
Campadelli, P., Lanzarotti, R., and Lipori, G. (2009). Pre-
cise eye and mouth localization. Int. Journal of Pat-
tern Recognition and Artificial Intelligence, 23:359–
377.
Chapelle, O. and Wu, M. (2010). Gradient descent opti-
mization of smoothed information retrieval metrics.
Information retrieval, 13:216–235.
Choi, J., Lee, K., and Song, B. (2020). Eye pupil localiza-
tion algorithm using convolutional neural networks.
Multimedia Tools and Applications, 79:32563–32574.
Everingham, M. and Zisserman, A. (2006). Regression and
classification approaches to eye localization in face
images. In International Conference on Automatic
Face and Gesture Recognition, pages 441–446. IEEE.
He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep resid-
ual learning for image recognition. In International
Conference on Computer Vision and Pattern Recogni-
tion, pages 770–778. IEEE.
Jesorsky, O., Kirchbergand, K. J., and Frischholz, R.
(2001). Robust face detection using the hausdorff dis-
tance. In Audio and Video Biom. Pers. Authentication,
pages 90–95.
Kar, A. and Corcoran, P. (2017). A review and analysis
of eye-gaze estimation systems, algorithms and per-
formance evaluation methods in consumer platforms.
IEEE Access, 5:16495–16519.
Kingma, D. and Ba, J. (2015). Adam: A method for
stochastic optimization. In IEEE International Con-
ference on Learning Representations. IEEE.
Krafka, K., Khosla, A., Kellnhofer, P., Kannan, H., Bhan-
darkar, S., Matusik, W., and Torralba, A. (2016). Eye
tracking for everyone. In International Conference
on Computer Vision and Pattern Recognition (CVPR),
pages 2176–2184. IEEE.
Lee, K., Jeon, J., and Song, B. (2020). Deep learning-based
pupil center detection for fast and accurate eye track-
ing system. In European Conference on Computer Vi-
sion (ECCV), pages 36–52. Springer.
DeepPupil Net: Deep Residual Network for Precise Pupil Center Localization
303