(2017). Single shot text detector with regional attention.
In 2017 IEEE International Conference on Computer Vi-
sion (ICCV), pages 3066–3074.
He, T., Huang, W., Qiao, Y., and Yao, J. (2016). Text-
attentional convolutional neural network for scene text
detection. IEEE Transactions on Image Processing,
25(6):2529–2541.
Howard, A. G., Zhu, M., Chen, B., Kalenichenko, D.,
Wang, W., Weyand, T., Andreetto, M., and Adam,
H. (2017). Mobilenets: Efficient convolutional neu-
ral networks for mobile vision applications. CoRR,
abs/1704.04861.
Karatzas, D., Gomez-Bigorda, L., Nicolaou, A., Ghosh, S.,
Bagdanov, A., Iwamura, M., Matas, J., Neumann, L.,
Chandrasekhar, V. R., Lu, S., Shafait, F., Uchida, S., and
Valveny, E. (2015). Icdar 2015 competition on robust
reading. In 2015 13th International Conference on Doc-
ument Analysis and Recognition (ICDAR), pages 1156–
1160.
Karatzas, D., Mestre, S. R., Mas, J., Nourbakhsh, F., and
Roy, P. P. (2011). Icdar 2011 robust reading competition
- challenge 1: Reading text in born-digital images (web
and email). In 2011 International Conference on Docu-
ment Analysis and Recognition, pages 1485–1490.
Karatzas, D., Shafait, F., Uchida, S., Iwamura, M., Big-
orda, L. G. i., Mestre, S. R., Mas, J., Mota, D. F., Al-
maz
`
an, J. A., and de las Heras, L. P. (2013). Icdar
2013 robust reading competition. In Proceedings of the
2013 12th International Conference on Document Anal-
ysis and Recognition, ICDAR ’13, pages 1484–1493,
Washington, DC, USA.
Liao, M., Shi, B., and Bai, X. (2018). Textboxes++: A
single-shot oriented scene text detector. IEEE Transac-
tions on Image Processing, 27(8):3676–3690.
Liao, M., Shi, B., Bai, X., Wang, X., and Liu, W. (2017).
Textboxes: A fast text detector with a single deep neural
network. In Proceedings of the Thirty-First AAAI Confer-
ence on Artificial Intelligence, February 4-9, 2017, San
Francisco, California, USA., pages 4161–4167.
Lin, T., Doll
´
ar, P., Girshick, R., He, K., Hariharan, B., and
Belongie, S. (2017). Feature pyramid networks for object
detection. In 2017 IEEE Conference on Computer Vision
and Pattern Recognition (CVPR), pages 936–944.
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu,
C.-Y., and Berg, A. C. (2016). SSD: Single shot multibox
detector. In Leibe, B., Matas, J., Sebe, N., and Welling,
M., editors, Computer Vision – ECCV 2016, pages 21–
37, Cham. Springer International Publishing.
Neubeck, A. and Gool, L. V. (2006). Efficient non-
maximum suppression. In 18th International Conference
on Pattern Recognition (ICPR’06), volume 3, pages 850–
855.
Redmon, J. and Farhadi, A. (2018). Yolov3: An incremental
improvement. CoRR, abs/1804.02767.
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and
Chen, L. (2018). Mobilenetv2: Inverted residuals and
linear bottlenecks. In 2018 IEEE/CVF Conference on
Computer Vision and Pattern Recognition, pages 4510–
4520.
Shi, B., Bai, X., and Yao, C. (2017). An end-to-end train-
able neural network for image-based sequence recogni-
tion and its application to scene text recognition. IEEE
Transactions on Pattern Analysis and Machine Intelli-
gence, 39(11):2298–2304.
Tang, Y. and Wu, X. (2017). Scene text detection
and segmentation based on cascaded convolution neu-
ral networks. IEEE Transactions on Image Processing,
26(3):1509–1520.
Tieleman, T. and Hinton, G. (2012). Lecture 6.5-rmsprop:
Divide the gradient by a running average of its recent
magnitude. COURSERA: Neural networks for machine
learning, 4(2):26–31.
Wang, L., Wang, Z., Qiao, Y., and Van Gool, L. (2018).
Transferring deep object and scene representations for
event recognition in still images. International Journal
of Computer Vision, 126(2):390–409.
Wu, B., Iandola, F., Jin, P. H., and Keutzer, K. (2017).
Squeezedet: Unified, small, low power fully convo-
lutional neural networks for real-time object detection
for autonomous driving. In 2017 IEEE Conference on
Computer Vision and Pattern Recognition Workshops
(CVPRW), pages 446–454.
Yan, C., Xie, H., Liu, S., Yin, J., Zhang, Y., and Dai,
Q. (2018). Effective uyghur language text detection in
complex background images for traffic prompt identifi-
cation. IEEE Trans. Intelligent Transportation Systems,
19(1):220–229.
Ye, Q. and Doermann, D. S. (2015). Text detection and
recognition in imagery: A survey. IEEE Trans. Pattern
Anal. Mach. Intell., 37(7):1480–1500.
Yi, C., Tian, Y., and Arditi, A. (2014). Portable camera-
based assistive text and product label reading from hand-
held objects for blind persons. IEEE/ASME Transactions
on Mechatronics, 19(3):808–817.
Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., and Bai,
X. (2016). Multi-oriented text detection with fully con-
volutional networks. In The IEEE Conference on Com-
puter Vision and Pattern Recognition (CVPR).
Zhu, Y., Liao, M., Yang, M., and Liu, W. (2018). Cascaded
segmentation-detection networks for text-based traffic
sign detection. IEEE Transactions on Intelligent Trans-
portation Systems, 19(1):209–219.
VISAPP 2020 - 15th International Conference on Computer Vision Theory and Applications
350