Fast Scene Text Detection with RT-LoG Operator and CNN

Dinh Cong Nguyen, Dinh Cong Nguyen, Mathieu Delalandre, Donatello Conte, The Anh Pham

2020

Abstract

Text detection in scene images is of particular importance for the computer-based applications. The text detection methods must be robust against variabilities and deformations of text entities. In addition, to be embedded into mobile devices, the methods have to be time efficient. In this paper, the keypoint grouping method is proposed by first applying the real-time Laplacian of Gaussian operator (RT-LoG) to detect keypoints. These keypoints will be grouped to produce the character patterns. The patterns will be filtered out by using a CNN model before aggregating into words. Performance evaluation is discussed on the ICDAR2017 RRC-MLT and the Challenge 4 of ICDAR2015 datasets. The results are given in terms of detection accuracy and time processing against different end-to-end systems in the literature. Our system performs as one of the strongest detection accuracy while supporting at approximately 15.6 frames per second to the HD resolution on a regular CPU architecture. It is one of the best candidates to guarantee the trade-off between accuracy and speed in the literature.

Download


Paper Citation


in Harvard Style

Nguyen D., Delalandre M., Conte D. and Pham T. (2020). Fast Scene Text Detection with RT-LoG Operator and CNN. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 4: VISAPP; ISBN 978-989-758-402-2, SciTePress, pages 237-245. DOI: 10.5220/0008944502370245


in Bibtex Style

@conference{visapp20,
author={Dinh Cong Nguyen and Mathieu Delalandre and Donatello Conte and The Anh Pham},
title={Fast Scene Text Detection with RT-LoG Operator and CNN},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 4: VISAPP},
year={2020},
pages={237-245},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008944502370245},
isbn={978-989-758-402-2},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 4: VISAPP
TI - Fast Scene Text Detection with RT-LoG Operator and CNN
SN - 978-989-758-402-2
AU - Nguyen D.
AU - Delalandre M.
AU - Conte D.
AU - Pham T.
PY - 2020
SP - 237
EP - 245
DO - 10.5220/0008944502370245
PB - SciTePress