
shape priors for image labeling. 2013 IEEE Con-
ference on Computer Vision and Pattern Recognition,
pages 2019–2026.
Kuang, Z. and Tie, X. (2021). Flow-based video seg-
mentation for human head and shoulders. ArXiv,
abs/2104.09752.
Le, V., Brandt, J., Lin, Z. L., Bourdev, L. D., and Huang,
T. S. (2012). Interactive facial feature localization. In
European Conference on Computer Vision.
Lee, C.-H., Liu, Z., Wu, L., and Luo, P. (2019). Maskgan:
Towards diverse and interactive facial image manip-
ulation. 2020 IEEE/CVF Conference on Computer
Vision and Pattern Recognition (CVPR), pages 5548–
5557.
Li, J., Ma, S., Zhang, J., and Tao, D. (2021). Privacy-
preserving portrait matting. Proceedings of the 29th
ACM International Conference on Multimedia.
Liang, J., Zeng, H., Cui, M., Xie, X., and Zhang, L.
(2021). Ppr10k: A large-scale portrait photo retouch-
ing dataset with human-region mask and group-level
consistency. 2021 IEEE/CVF Conference on Com-
puter Vision and Pattern Recognition (CVPR), pages
653–661.
Lin, S., Ryabtsev, A., Sengupta, S., Curless, B., Seitz, S. M.,
and Kemelmacher-Shlizerman, I. (2020). Real-time
high-resolution background matting. 2021 IEEE/CVF
Conference on Computer Vision and Pattern Recogni-
tion (CVPR), pages 8758–8767.
Lin, T.-Y., Doll
´
ar, P., Girshick, R., He, K., Hariharan, B.,
and Belongie, S. (2017). Feature pyramid networks
for object detection.
Lin, Y., Shen, J., Wang, Y., and Pantic, M. (2021). Roi tanh-
polar transformer network for face parsing in the wild.
Image Vis. Comput., 112:104190.
Liu, Y., Shi, H., Shen, H., Si, Y., Wang, X., and Mei, T.
(2020). A new dataset and boundary-attention seman-
tic segmentation for face parsing. In AAAI Conference
on Artificial Intelligence.
Liu, Z., Luo, P., Wang, X., and Tang, X. (2015). Deep learn-
ing face attributes in the wild.
Long, J., Shelhamer, E., and Darrell, T. (2014). Fully con-
volutional networks for semantic segmentation. 2015
Proceedings of the IEEE Conference on Computer Vi-
sion and Pattern Recognition (CVPR), 3431-3440.
Long, J., Shelhamer, E., and Darrell, T. (2015). Fully con-
volutional networks for semantic segmentation.
Loshchilov, I. and Hutter, F. (2017). Decoupled
weight decay regularization. 2019 The International
Conference on Learning Representations (ICLR),
abs/1711.05101.
Luo, L., Xue, D., and Feng, X. (2020). Ehanet: An effec-
tive hierarchical aggregation network for face parsing.
Applied Sciences, 10(9):3135.
Park, H., Sj
¨
osund, L. L., Monet, N., Yoo, Y., and Kwak,
N. (2019a). Sinet: Extreme lightweight portrait
segmentation networks with spatial squeeze modules
and information blocking decoder. arXiv preprint
arXiv:1911.09099.
Park, H., Sj
¨
osund, L. L., Yoo, Y., and Kwak, N. (2019b).
Extremec3net: Extreme lightweight portrait segmen-
tation networks using advanced c3-modules. arXiv
preprint arXiv:1908.03093.
Poudel, R. P. K., Liwicki, S., and Cipolla, R. (2019). Fast-
scnn: Fast semantic segmentation network.
Ryumina, E., Dresvyanskiy, D., and Karpov, A. (2022).
In search of a robust facial expressions recognition
model: A large-scale visual cross-corpus study. Neu-
rocomputing.
Sander, E. L. J. (2020). Coronavirus could spark a revo-
lution in working from home: Are we ready? The
conversation.
Shen, X., Hertzmann, A., Jia, J., Paris, S., Price, B. L.,
Shechtman, E., and Sachs, I. (2016). Automatic por-
trait segmentation for image stylization. Computer
Graphics Forum, 35.
Taherkhani, F., Nasrabadi, N. M., and Dawson, J. (2018).
A deep face identification network enhanced by facial
attributes prediction.
Wood, E., Baltruvsaitis, T., Hewitt, C., Dziadzio, S., John-
son, M., Estellers, V., Cashman, T. J., and Shotton, J.
(2021). Fake it till you make it: face analysis in the
wild using synthetic data alone. 2021 IEEE/CVF In-
ternational Conference on Computer Vision (ICCV),
pages 3661–3671.
Xie, E., Wang, W., Yu, Z., Anandkumar, A.,
´
Alvarez, J. M.,
and Luo, P. (2021). Segformer: Simple and efficient
design for semantic segmentation with transformers.
In Neural Information Processing Systems.
Yin, X. and Chen, L. L. (2022). Faceocc: A diverse, high-
quality face occlusion dataset for human face extrac-
tion. ArXiv, abs/2201.08425.
Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N.
(2018). Bisenet: Bilateral segmentation network for
real-time semantic segmentation. In European Con-
ference on Computer Vision.
Zhu, X., Liu, X., Lei, Z., and Li, S. Z. (2017). Face align-
ment in full pose range: A 3d total solution. IEEE
transactions on pattern analysis and machine intelli-
gence.
APPENDIX
In the supplementary materials, we include illustra-
tions of selected samples with their corresponding
annotation masks from the EasyPortrait dataset, the
head turns distributions across different datasets, the
guidelines for class annotations, and the evaluation re-
sults on the EasyPortrait dataset.
VISAPP 2025 - 20th International Conference on Computer Vision Theory and Applications
336