Blood Sediment Rate (LED) is a measure of
erythrocyte sedimentary velocity describing plasma
composition as well as erythrocyte and plasma
comparisons. LEDs are affected by the weight of
blood cells and cell surface area and the earth’s
gravity.
Platelets are the smallest element in the blood
vessels. Platelets are activated after contact with the
surface of the endothelial wall.
3 PREVIOUS RESEARCH
Eugenio et al. (2014) performed a research on the
creation of a system that could produce summary text
from physician doctors’ briefed notes and nurse
structural documentation (containing patient care
plans) for patients with inpatient heart disease. This
summary text is useful for helping patients to take
care of themselves after their hospitalization and as
an ap-proach to educate patients about what
treatments are being performed to patients during the
inpatient process.
Archarya et al. (2016) create a system for gener
ating summary text of patient hospital data by com-
bining information from two heterogeneous sources of
doctors and nurses documentation. Their study fo-
cuses on producing summary text taking into consid-
eration the complexities of medical terms. The first
step is to extract written content of the medical docu-
ment from the mix of both sources, and then the con-
tent is identified to determine if there are any terms
that belong to simple (unexplainable) terms or com-
plex (terms that need explanation) using metrics cre-
ated.
Another research by Mahamood and Reiter
(2011) focused on the effective approach of creating
a system that generates a text of medical information
reports for parents of premature babies. They analyze
the signal and interpret EMR data to identify the
important events and the relationship between the
events occur-ring in the EMR data. Then use the
NLG method to convert the EMR data into a
narrative text. Their research focused on the text
produced by the system that could be understood by
people who are not professionals in the medical field
and the resulting report text only gives positive
information about infant development.
The difference of this research with the previous
research works is that in this study we implement
Natural Language Generation to interpret the results
of hematological examination of patients into the
form of summary text using Template Generation
System (TGen-System). TGen System generates the
template candidates (i.e sentences with related slots)
automatically which has been classified by
considering the content sentences.
4 METHODOLOGY
In this research we implement NLG template-based
to interpret the data of Complete Blood Count (CBC)
into the Indonesian textual representation. The sys-
tem, called Complete Blood Count Interpreter System
(CBCI-System), employs Natural Language Genera-
tion (NLG) concept in generating Indonesian textual
representation. The textual representation is deployed
by filling related data into the appropriate template
slots. Furthermore to handle the limitation of
traditional template-based approach in term of text
diversity and maintainability, we propose Template
Generation System (TGen System). TGen System
generates template candidates that has been classified
based on content of the sentences. This system helps
CBCI System to produce the textual report of CBC
result which is not only varied but also easier to
understand. The proposed architecture of TGen
System is presented in Figure 1.
As shown in Figure 1, TGen System generates the
list of sentence templates based on the related corpus
(i.e. corpus existing text interpretation of CBC)
through Text Segmentation, Slot Generation, Simi lar
Template Removing, and Template Classification.
During the process of TGen system, it requires
linguistics knowledge, which is obtained from the
hematology experts.
Since the related corpus is the textual report
examples of CBC result, the first process of TGen
System is Text Segmentation. Conceptually, Text
Segmentation works on the sentence level, then it is
used to split every sentence contained in corpus based
on the newline and end of character. After sentences
are segmented by Text Segmentation, the system will
decide words or phrases, which are the related slot
candidates. The output of Slot Generation can be
called as the template candidates. Since Slot
Generation may generate the same templates, Similar
template Removing is responsible to collect one
template called as unique sentence template.
Furthermore unique sentence template is classified
into three content sentences (such as the opening
sentence, general description sentence, and detail
description sentence) by using linguistics knowledge.
Finally, Output of Template Classification will be
template in the interpretation of generated CBC result.