
4 CONCLUSION
In conclusion, this paper introduced a well-rounded
approach toward automated military badge detection
on government documents by utilizing the YOLOv5
model. By innovatively automating the training data
labelling process and generating a simulated dataset
of military and official documents, circumventing the
issue of public unavailability, a scalable and precise
badge detection system was established. Through
strategic training and hyper-parameter tuning, the
YOLOv5 model showcased substantial proficiency in
detecting various badge types within the documents,
illustrating a promising stride in document-based ob-
ject detection.
REFERENCES
Arvind, N. (2023). A semi-automatic method for document
classification in the shipping industry. In Proceedings
of Neptune’s conference, Samudramanthan 2023 IIT
Kharagpur.
Bailey, E. S., Bonnici, A., and Cristina, S. (2022). A cas-
caded approach for page-object detection in scientific
papers. In Proceedings of the 22nd ACM Symposium
on Document Engineering, DocEng ’22, New York,
NY, USA. Association for Computing Machinery.
Brown, J. D. (2010). Developing an automatic document
classification system: A review of current literature
and future directions. Technical Memorandum DRDC
Ottawa TM 2009-269, Defence Research and Devel-
opment Canada.
Chiu, P., Chen, F., and Denoue, L. (2010). Picture detection
in document page images.
Ciecierski, K. and Kamola, M. (2020). Comparison of Text
Classification Methods for Government Documents,
pages 39–49.
Deng, Q., Ibrayim, M., Hamdulla, A., and Zhang, C.
(2023). The yolo model that still excels in document
layout analysis. Preprint under review at Signal, Im-
age and Video Processing as of August 2023.
Forczma
´
nski, P., Smolinski, A., Nowosielski, A., and
Małecki, K. (2020). Segmentation of Scanned Doc-
uments Using Deep-Learning Approach, pages 141–
152.
Henderson, P. and Ferrari, V. (2017). End-to-end training
of object class detectors for mean average precision.
In Lai, S.-H., Lepetit, V., Nishino, K., and Sato, Y.,
editors, Computer Vision – ACCV 2016, pages 198–
213, Cham. Springer International Publishing.
Huber-fliflet, N., Wei, F., Zhao, H., Qin, H., Ye, S., and
Tsang, A. (2019). Image analytics for legal document
review : A transfer learning approach. In 2019 IEEE
International Conference on Big Data (Big Data),
pages 4325–4328.
Kallempudi, G., Hashmi, K. A., Pagani, A., Liwicki, M.,
Stricker, D., and Afzal, M. Z. (2022). Toward semi-
supervised graphical object detection in document im-
ages. Future Internet, 14(6).
Li, M. and Ghosal, S. (2014). Bayesian Multiscale Smooth-
ing of Gaussian Noised Images. Bayesian Analysis,
9(3):733 – 758.
Liu, C., Tao, Y., Liang, J., Li, K., and Chen, Y. (2018). Ob-
ject detection based on yolo network. In 2018 IEEE
4th Information Technology and Mechatronics Engi-
neering Conference (ITOEC), pages 799–803.
Orosz, T., V
´
agi, R., Cs
´
anyi, G. M., Nagy, D.,
¨
Uveges, I.,
Vad
´
asz, J. P., and Megyeri, A. (2022). Evaluating hu-
man versus machine learning performance in a legal-
tech problem. Applied Sciences, 12(1).
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
(2016). You only look once: Unified, real-time ob-
ject detection. In 2016 IEEE Conference on Computer
Vision and Pattern Recognition (CVPR), pages 779–
788.
Rezkiani, K., Nurtanio, I., and Syafaruddin (2022). Logo
detection using you only look once (yolo) method.
In 2022 2nd International Conference on Electronic
and Electrical Engineering and Intelligent System
(ICE3IS), pages 29–33.
Sarwo, Heryadi, Y., Abdulrachman, E., and Budiharto, W.
(2019). Logo detection and brand recognition with
one-stage logo detection framework and simplified
resnet50 backbone. In 2019 International Congress
on Applied Information Technology (AIT), pages 1–6.
Song, Y., Li, Z., He, J., Li, Z., Fang, X., and Chen,
D. (2019). Employing auto-annotated data for gov-
ernment document classification. In Proceedings of
the 2019 3rd International Conference on Innovation
in Artificial Intelligence, ICIAI ’19, pages 121–125,
New York, NY, USA. Association for Computing Ma-
chinery.
Suto, J. (2023). Improving the generalization capability of
yolov5 on remote sensed insect trap images with data
augmentation. Multimedia Tools and Applications.
Zou, Z., Chen, K., Shi, Z., Guo, Y., and Ye, J. (2023). Ob-
ject detection in 20 years: A survey. Proceedings of
the IEEE, 111(3):257–276.
Military Badge Detection and Classification Algorithm for Automatic Processing of Documents
731