
mAP@0.50. Among the methods evaluated, image
compositing stood out as the most effective in terms
of performance, achieving the highest precision and
recall scores, as well as the best mAP.
Our results validate the hypothesis that data aug-
mentation can significantly enhance the performance
of object detection models, even in the presence of
complex and imbalanced datasets. Moving forward,
we plan to further refine and optimize the augmen-
tation strategies, combining them with cutting-edge
techniques such as generative adversarial networks
and semi-supervised learning methods. Additionally,
extending our approach to larger datasets and applying
it across other domains, such as autonomous vehicles
and medical imaging, presents an exciting direction for
future work. Our ultimate goal is to continue advanc-
ing the state-of-the-art in object detection, improving
both model accuracy and computational efficiency.
REFERENCES
Akkem, Y., Biswas, S. K., and Varanasi, A. (2024). A com-
prehensive review of synthetic data generation in smart
farming by using variational autoencoder and genera-
tive adversarial network. Engineering Applications of
Artificial Intelligence, 131:107881.
Dou, H.-X., Lu, X.-S., Wang, C., Shen, H.-Z., Zhuo, Y.-W.,
and Deng, L.-J. (2022). Patchmask: A data augmen-
tation strategy with gaussian noise in hyperspectral
images. Remote Sensing, 14(24).
Ho, J., Jain, A., and Abbeel, P. (2020). Denoising diffusion
probabilistic models.
Hu, J., Xiao, F., Jin, Q., Zhao, G., and Lou, P. (2023). Syn-
thetic data generation based on rdb-cyclegan for indus-
trial object detection. Mathematics, 11(22).
Lafontaine, V., Bouchard, K., Maître, J., and Gaboury, S.
(2024). Generating synthetic augmentation data from
a practical uwb radar dataset using vq-vae. In Pro-
ceedings of the 2024 International Conference on In-
formation Technology for Social Good, GoodIT ’24,
page 212–215, New York, NY, USA. Association for
Computing Machinery.
Lee, J.-H., Zaheer, M. Z., Astrid, M., and Lee, S.-I. (2020).
Smoothmix: A simple yet effective data augmenta-
tion to train robust classifiers. In Proceedings of the
IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition (CVPR) Workshops.
Liang, W., Liang, Y., and Jia, J. (2023). Miamix: Enhancing
image classification through a multi-stage augmented
mixed sample data augmentation method. Processes,
11(12).
Mikołajczyk, A. and Grochowski, M. (2018). Data augmen-
tation for improving deep learning in image classifica-
tion problem. In 2018 International Interdisciplinary
PhD Workshop (IIPhDW), pages 117–122.
Perez, L. and Wang, J. (2017). The effectiveness of data aug-
mentation in image classification using deep learning.
Podell, D., English, Z., Lacey, K., Blattmann, A., Dock-
horn, T., Müller, J., Penna, J., and Rombach, R. Sdxl:
Improving latent diffusion models for high-resolution
image synthesis. In The Twelfth International Confer-
ence on Learning Representations.
Puttaruksa, C. and Taeprasartsit, P. (2018). Color data aug-
mentation through learning color-mapping parameters
between cameras. In 2018 15th International Joint
Conference on Computer Science and Software Engi-
neering (JCSSE), pages 1–6.
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and Om-
mer, B. (2021). High-resolution image synthesis with
latent diffusion models. 2022 IEEE/CVF Conference
on Computer Vision and Pattern Recognition (CVPR),
pages 10674–10685.
Rombach, R., Blattmann, A., Lorenz, D., Esser, P., and
Ommer, B. (2022). High-resolution image synthesis
with latent diffusion models. In Proceedings of the
IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition (CVPR), pages 10684–10695.
Shijie, J., Ping, W., Peiyi, J., and Siping, H. (2017). Re-
search on data augmentation for image classification
based on convolution neural networks. In 2017 Chinese
Automation Congress (CAC), pages 4165–4170.
Sumari, A. (2009). A study on identification friend, foe, or
neutral methods: The performance of supervised and
unsupervised neural networks in performing aircraft
identification tasks. 5:10.
Takahashi, R., Matsubara, T., and Uehara, K. (2020). Data
augmentation using random image cropping and patch-
ing for deep cnns. IEEE Transactions on Circuits and
Systems for Video Technology, 30(9):2917–2931.
Trabucco, B., Doherty, K., Gurinas, M., and Salakhutdinov,
R. (2023). Effective data augmentation with diffusion
models.
Walawalkar, D., Shen, Z., Liu, Z., and Savvides, M. (2020).
Attentive cutmix: An enhanced data augmentation ap-
proach for deep learning based image classification.
CoRR, abs/2003.13048.
Wu, Z., Wang, L., Wang, W., Shi, T., Chen, C., Hao, A.,
and Li, S. (2022). Synthetic data supervised salient
object detection. In Proceedings of the 30th ACM
International Conference on Multimedia, MM ’22,
page 5557–5565, New York, NY, USA. Association
for Computing Machinery.
Xiao, X., Ganguli, S., and Pandey, V. (2020). Vae-info-cgan:
generating synthetic images by combining pixel-level
and feature-level geospatial conditional inputs. In Pro-
ceedings of the 13th ACM SIGSPATIAL International
Workshop on Computational Transportation Science,
IWCTS ’20, New York, NY, USA. Association for
Computing Machinery.
Zhang, L., Rao, A., and Agrawala, M. (2023). Adding con-
ditional control to text-to-image diffusion models. In
Proceedings of the IEEE/CVF International Confer-
ence on Computer Vision, pages 3836–3847.
VISAPP 2025 - 20th International Conference on Computer Vision Theory and Applications
796