word error detection; (b) isolated-word error
correction; and (c) context-dependent word
correction. However, many proposed techniques rely
on a lexicon-based approach.
The typical spelling correction system by
(Levenshtein, Vladimir I., 1966) is based on
minimum edit distance which ranks suggestion by
the least number of inclusion, removal, replacement
and reversal required to convert one string into the
other.
(Turchin, Alexander, et al, 2007) identified
incorrect words by comparing them to some
predefined list of words, but this baseline method is
extended by doing prevalence analysis, i.e.
determining the frequency ratio of a word and its
one edit distance alternatives in the corpus.
(Patrick, Jon, and Dung. 2011) used numerous
knowledge bases of English clinical terms in
addition to utilizing statistical methods.
Mass noun errors in English are solved by
(Brockett et al., 2006), who focused on grammatical
errors rather than on orthographical. Their work is
related to the (Ehsan, Nava, and Faili, 2013) where
the traditional SMT algorithm is used for spelling
error preciseness. Though, in the approach, errors
were initiated artificially.
(Siklósi et al., 2016) presented a new method for
automatic correction in Hungarian clinical records
by means of a SMT decoder. Due to the lack of a
corpus normalized, a realistic aim was not fully
achieved.
Some spelling suggestion tools such as Aspell
and Gspell also exist in the English language for use
and exploration. Aspell is a mixture of the Meta-
phone algorithm and near-miss strategy. While
NGrams, metaphone, common misspellings, and
homophone retrieval tools are present in Gspell.
Candidates are evaluated by the Levenshtein edit
distance, and similar ranked candidates are re-
ordered by (Divita, G., 2003).
A frequency-based approach joining a medical
dictionary configuration was built to improve
recommendations of Aspell and Gspell by (Crowell
et al., 2004). Turchin et al. used prevalence analysis
for correction. (Senger, Christian, et al., 2010) made
use of Aspell and user activities to analyze
medication misspellings in a drug query system.
6 CONCLUSIONS AND FUTURE
WORK
As we know that medical records play a significant
role in everyone’s life. So, the focus of this study is
to illustrate the problems of Electronic Medical
Records. Based on the causes of data errors, an
effective improvement to the EMR would be to
expand its scope to classify possible medications. In
our paper, we presented a method to correct single
spelling errors by concentrating more on ‘how’ and
‘why’ part of the searching instead of ‘what’,
making a firm base for extending it to the correction
of multiple errors as well. POST, Regular
Expressions, and Information Retrieval played an
important role in substitution. The overall accuracy
of the system is a technically better than traditional
techniques.
Even with EMR extensions, 100% accuracy
can’t be guaranteed, some minor error chances will
remain (Wagner, et al., 1996). As the domain of
searching is very gigantic; still more work is
required to gather more accurate and close results.
So, we have some future plans for including text
parsing in it and will do the implementation of this
technique for free-text clinical records to provide
more ease to practitioners.
REFERENCES
Brockett, C., Dolan, W.B. and Gamon, M., 2006, July.
Correcting ESL errors using phrasal SMT techniques.
In Proceedings of the 21st International Conference
on Computational Linguistics and the 44th annual
meeting of the Association for Computational
Linguistics (pp. 249-256). Association for
Computational Linguistics.
Crowell, J., Zeng, Q., Ngo, L., and Lacroix, E.M., 2004. A
frequency-based technique to improve the spelling
suggestion rank in medical queries. Journal of the
American Medical Informatics Association, 11(3),
pp.179-185.
DesRoches, C.M., Campbell, E.G., Rao, S.R., Donelan,
K., Ferris, T.G., Jha, A., Kaushal, R., Levy, D.E.,
Rosenbaum, S., Shields, A.E. and Blumenthal, D.,
2008. Electronic health records in ambulatory care—a
national survey of physicians. New England Journal of
Medicine, 359(1), pp.50-60.
Divita, G. 2003. Spelling Suggestion Tools (Gspell),
National Library of Medicine. Available at
http://lexsrv3.nlm.nih.gov/LexSysGroup/Projects/gSpe
ll/current/GSpell.html.
Ehsan, N., and Faili, H., 2013. Grammatical and
contextsensitive error correction using a statistical
machine translation framework. Software: Practice
and Experience, 43(2), pp.187-206.
Kukich, K., 1992. Techniques for automatically correcting
words in text.ACM Computing Surveys (CSUR), 24(4),
pp.377-439.
Levenshtein, V.I., 1966, February. Binary codes capable
Identification and Correction of Misspelled Drugs’ Names in Electronic Medical Records (EMR)