Evaluation of Open-Source OCR Libraries for Scene Text Recognition in the Presence of Fisheye Distortion

María Flores, David Valiente, Marcos Alfaro, Marc Fabregat-Jaén, Luis Payá

2024

Abstract

Due to the rich and precise semantic information that text provides, scene text recognition is relevant in a wide range of vision-based applications. In recent years, the use of vision systems that combine a camera and a fisheye lens is common in a variety of applications. The addition of a fisheye lens has the great advantage of capturing a wider field of view, but this causes a great deal of distortion, making certain tasks challenging. In many applications, such as localization or mapping for a mobile robot, the algorithms work directly with fisheye images (i.e. distortion is not corrected). For this reason, the principal objective of this work is to study the effectiveness of some OCR (Optical Character Recognition) open-source libraries applied to images with fisheye distortion. Since no scene text dataset of this kind of image has been found, this work also generates a synthetic image dataset. A fisheye model which varies some parameters is applied to standard images of a benchmark scene text dataset to generate the proposed dataset.

Download


Paper Citation


in Harvard Style

Flores M., Valiente D., Alfaro M., Fabregat-Jaén M. and Payá L. (2024). Evaluation of Open-Source OCR Libraries for Scene Text Recognition in the Presence of Fisheye Distortion. In Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO; ISBN 978-989-758-717-7, SciTePress, pages 133-140. DOI: 10.5220/0012927600003822


in Bibtex Style

@conference{icinco24,
author={María Flores and David Valiente and Marcos Alfaro and Marc Fabregat-Jaén and Luis Payá},
title={Evaluation of Open-Source OCR Libraries for Scene Text Recognition in the Presence of Fisheye Distortion},
booktitle={Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO},
year={2024},
pages={133-140},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012927600003822},
isbn={978-989-758-717-7},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 21st International Conference on Informatics in Control, Automation and Robotics - Volume 2: ICINCO
TI - Evaluation of Open-Source OCR Libraries for Scene Text Recognition in the Presence of Fisheye Distortion
SN - 978-989-758-717-7
AU - Flores M.
AU - Valiente D.
AU - Alfaro M.
AU - Fabregat-Jaén M.
AU - Payá L.
PY - 2024
SP - 133
EP - 140
DO - 10.5220/0012927600003822
PB - SciTePress