The Effect of Font Variation in the Accuracy of Image to Text
Conversion
Vera Firmansyah and Amalia Rakhmawati
Academy of Metrology and Instrumentation, Ministry of Trade, Bandung, Indonesia
Keywords: Font, OCR, OpenCV, Accuracy.
Abstract: Image processing is using both hardware and software as tools to analyze and as an interface to process an
image. These tools are able to improve the welfare and the quality of life of people with visual impairments
to help them to read articles. The level of impaired vision can vary from person to person. Thus, this research
develops initiate step in image to text conversion with font variation. Image to text conversion is done by
extracting text from images obtained through the camera from the article. Previous research used
microprocessor equipped with a camera module and the Tesseract OCR in the Python Pence. The Tesseract
OCR program on Pence is an open source program used to extract text from images and save it in the form of
a text. This research using 5 font variation chosen randomly which are Times New Roman, Arial, Calibri,
Comic Sans and Courier New. Image Processing using Dilation, Crop, Canny and Median Blur. The result
shows that Comic Sans Font has the highest accuracy and Times New Roman has the lowest accuracy. Comic
Sans has the highest accuracy because the overall font does not have much curves than the other while Times
New Roman Font has the lowest accuracy because it has more curve characteristics.
1 INTRODUCTION
Image processing is using both hardware and
software as tools to analyze and as an interface to
process an image. In 2015 it is estimated that of the
7.33 trillion world population, there are 253 million
people (3.38%) who suffer from visual disturbances,
consisting of 36 million people experiencing
blindness, 217 million experiencing moderate to
severe visual impairment. In addition, there are 188
million people with mild visual disturbances (M. Patil
and R. Kagalkar, 2014).
The classification of visual
impairments used is in accordance with the WHO
classification, which is based on visual acuity
(Ministry of Health of the Republic of Indonesia,
2018).
Therefore, those tools are able to improve the
welfare and the quality of life of people with visual
impairments to help them to read articles. The level
of impaired vision can vary from person to person.
So this research develops initiate step in image to
text conversion with font variation. Image to text
conversion is done by extracting text from images
obtained through the camera from the article.
Previous research used microprocessor equipped
with a camera module and the Tesseract OCR
(Optical Character Recognition) program in the
Python OpenCV (Open Computer Vision)
programming (Rithika, H., B. N. Santhoshi, 2016
).
OpenCV is an API (Application Programming
Interface) library used because it has familiarity
with computer vision image processing. Computer
vision is a branch of image processing field which
allows computers to see like humans. With
computer vision, the computer can make decisions,
take action, and recognize objects. Some of the
implementations of computer vision are face
recognition, face detection, face / project tracking,
road tracking, etc. (Widja. I. B. P., 2017). The
Tesseract OCR program on OpenCV is an open
source program used to extract text from images and
save it in the form of a text. Fig. 1 shows the
Tesseract OCR program on OpenCV to convert
image to text.
This research aims to get the effect of font
variation in the accuracy of image to text
conversion. The fonts are chosen randomly which
are Times New Roman, Arial, Calibri, Comic Sans
and Courier New.