Deep Learning for Effective Classification and Information Extraction of Financial Documents
Valentin-Adrian Serbanescu, Maruf A. Dhali
2025
Abstract
The financial and accounting sectors are encountering increased demands to effectively manage large volumes of documents in today’s digital environment. Meeting this demand is crucial for accurate archiving, maintaining efficiency and competitiveness, and ensuring operational excellence in the industry. This study proposes and analyzes machine learning-based pipelines to effectively classify and extract information from scanned and photographed financial documents, such as invoices, receipts, bank statements, etc. It also addresses the challenges associated with financial document processing using deep learning techniques. This research explores several models, including LeNet5, VGG19, and MobileNetV2 for document classification and RoBERTa, LayoutLMv3, and GraphDoc for information extraction. The models are trained and tested on financial documents from previously available benchmark datasets and a new dataset with financial documents in Romanian. Results show MobileNetV2 excels in classification tasks (with accuracies of 99.24% with data augmentation and 93.33% without augmentation), while RoBERTa and LayoutLMv3 lead in extraction tasks (with F1-scores of 0.7761 and 0.7426, respectively). Despite the challenges posed by the imbalanced dataset and cross-language documents, the proposed pipeline shows potential for automating the processing of financial documents in the relevant sectors.
DownloadPaper Citation
in Harvard Style
Serbanescu V. and Dhali M. (2025). Deep Learning for Effective Classification and Information Extraction of Financial Documents. In Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM; ISBN 978-989-758-730-6, SciTePress, pages 749-756. DOI: 10.5220/0013261000003905
in Bibtex Style
@conference{icpram25,
author={Valentin-Adrian Serbanescu and Maruf Dhali},
title={Deep Learning for Effective Classification and Information Extraction of Financial Documents},
booktitle={Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM},
year={2025},
pages={749-756},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013261000003905},
isbn={978-989-758-730-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM
TI - Deep Learning for Effective Classification and Information Extraction of Financial Documents
SN - 978-989-758-730-6
AU - Serbanescu V.
AU - Dhali M.
PY - 2025
SP - 749
EP - 756
DO - 10.5220/0013261000003905
PB - SciTePress