Deep Learning for Effective Classification and Information Extraction of Financial Documents

Valentin-Adrian Serbanescu, Maruf A. Dhali

2025

Abstract

The financial and accounting sectors are encountering increased demands to effectively manage large volumes of documents in today’s digital environment. Meeting this demand is crucial for accurate archiving, maintaining efficiency and competitiveness, and ensuring operational excellence in the industry. This study proposes and analyzes machine learning-based pipelines to effectively classify and extract information from scanned and photographed financial documents, such as invoices, receipts, bank statements, etc. It also addresses the challenges associated with financial document processing using deep learning techniques. This research explores several models, including LeNet5, VGG19, and MobileNetV2 for document classification and RoBERTa, LayoutLMv3, and GraphDoc for information extraction. The models are trained and tested on financial documents from previously available benchmark datasets and a new dataset with financial documents in Romanian. Results show MobileNetV2 excels in classification tasks (with accuracies of 99.24% with data augmentation and 93.33% without augmentation), while RoBERTa and LayoutLMv3 lead in extraction tasks (with F1-scores of 0.7761 and 0.7426, respectively). Despite the challenges posed by the imbalanced dataset and cross-language documents, the proposed pipeline shows potential for automating the processing of financial documents in the relevant sectors.

Download


Paper Citation


in Harvard Style

Serbanescu V. and Dhali M. (2025). Deep Learning for Effective Classification and Information Extraction of Financial Documents. In Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM; ISBN 978-989-758-730-6, SciTePress, pages 749-756. DOI: 10.5220/0013261000003905


in Bibtex Style

@conference{icpram25,
author={Valentin-Adrian Serbanescu and Maruf Dhali},
title={Deep Learning for Effective Classification and Information Extraction of Financial Documents},
booktitle={Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM},
year={2025},
pages={749-756},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013261000003905},
isbn={978-989-758-730-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM
TI - Deep Learning for Effective Classification and Information Extraction of Financial Documents
SN - 978-989-758-730-6
AU - Serbanescu V.
AU - Dhali M.
PY - 2025
SP - 749
EP - 756
DO - 10.5220/0013261000003905
PB - SciTePress