data:image/s3,"s3://crabby-images/f6db0/f6db0dd848ac495fdaf5d52e18e8fe5fad1e0d1f" alt=""
Face Blending Data Augmentation for Enhancing Deep Classification
Emna Ghorbel
a
, Ghada Maddouri and Faouzi Ghorbel
CRISTAL Laboratory, GRIFT Research Group ENSI, La Manouba University 2010, La Manouba, Tunisia
Keywords:
Low Size Dataset, Face Data Augmentation, InceptionV3, Face Classification.
Abstract:
Facial image classification plays a vital role in computer vision applications, particularly in face recogni-
tion. Convolutional Neural Networks have excelled in this domain, however, their performance decline when
dealing with small facial datasets. In that context, data augmentation methods have been proposed. In line
with this, we introduce the Face Blending data augmentation method, which augments intra-class variability
while preserving image semantics. By interpolating faces, we generate non-linear deformations, resulting in
in-between images that maintain the original’s global aspect. Results show that Face Blending significantly
enhances facial classification. Comparisons with Mix-up and Random Erasing techniques reveal improved
accuracy, precision, recall, and F1 score, particularly with limited datasets. This method offers promise for
realistic applications contributing to more reliable and accurate facial classification systems with limited data.
1 INTRODUCTION
In the field of computer vision, Convolutional Neu-
ral Networks (CNNs) have made significant strides
in the recognition and classification of facial images.
In fact, CNNs have demonstrated their effectiveness
in a wide range of applications within facial analy-
sis, whether it involves 2D or 3D facial structures,
grayscale, or color images. Nevertheless, the perfor-
mance of CNNs can decline when confronted with the
challenges of small-scale facial datasets. The learn-
ing phase of neural network models demands copi-
ous data for convergence, and such datasets, in prac-
tical applications, often fall short. To address this
limitation, several data augmentation methods have
been proposed (Summers and Dinneen, 2019; Inoue,
2018; Kang et al., 2017; Zhong et al., 2020; Gatys
et al., 2015; Konno and Iwazume, 2018; Bowles et al.,
2018; Su et al., 2019; El-Sawy et al., 2016; Patel et al.,
2019; Ciregan et al., 2012; Sato et al., 2015; Patel
et al., 2019; Yin et al., 2019; Paulin et al., 2014; Chat-
field et al., 2014). These techniques can be classified
into three distinct categories. The first one encom-
passes the geometric-based methods as the similari-
ties’ transformations (scale, rotation, translation, and
flipping) on images (Ciregan et al., 2012; Sato et al.,
2015; Simard et al., 2003), Part-based method apply-
ing linear and affine transformations on shape parts
after a cut detection process (Patel et al., 2019), and
a
https://orcid.org/0000-0002-6179-1358
the Shape morphing based technique for augmenting
2D shapes (Ghorbel et al., 2023). The second cate-
gory is driven by deep learning as Neural Style Trans-
fer data augmentation (Gatys et al., 2015), Features
space based on Auto-Encoders (Konno and Iwazume,
2018), Generative Adversarial Neuronal (GANs) ar-
chitectures (Bowles et al., 2018). The third cate-
gory operates at the pixel level, deploying techniques
such as kernel filters (Kang et al., 2017), homogra-
phy (Paulin et al., 2014), mixing pixels (Summers
and Dinneen, 2019; Inoue, 2018), and random eras-
ing techniques (Yin et al., 2019; Zhong et al., 2020).
Despite their contributions, adapting these aug-
mentation methods for facial classification can be es-
pecially challenging, given the nuanced complexities
involved in this task. Often, these methods prove in-
adequate in capturing the nuances of intra-class vari-
ations, which may lead to the loss of significant facial
details in the process.
In this paper, we introduce a novel data augmenta-
tion technique meticulously crafted to enhance CNN
performance in facial classification. Our method aims
to augment intra-class variability while preserving the
semantic integrity of facial images. We present the
Face Blending data augmentation, tailored specifi-
cally to improve facial classification tasks. This pa-
per explores the theoretical foundations, integration
of Face Blending into CNN architecture, and its ap-
plication to three small-scale facial datasets. Our ob-
jective is to provide compelling evidence that this in-
274
Ghorbel, E., Maddouri, G. and Ghorbel, F.
Face Blending Data Augmentation for Enhancing Deep Classification.
DOI: 10.5220/0012357900003654
Paper published under CC license (CC BY-NC-ND 4.0)
In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods (ICPRAM 2024), pages 274-280
ISBN: 978-989-758-684-2; ISSN: 2184-4313
Proceedings Copyright © 2024 by SCITEPRESS – Science and Technology Publications, Lda.