optimal solution than the previous researches.
Bahasa was selected as the focus of this research
because many existing researches used foreign
languages such as English, Swedish, etc., as its main
focus. Lack of knowledge of the Indonesian people
about good and proper grammar is the other reason.
It caused slang or colloquial language which is often
unconsciously used in the official documents that
should have used the proper language.
This research is expected to help Indonesian
people in the process of typing a document more
quickly and precisely based on the proper
Indonesian language.
2 RELATED WORKS
Previous research that discuss about measuring
performance of predictive text system was a research
about keystrokes per character (KSPC) and
keystroke saving (MacKenzie, 2002). The result of
the research said that the smaller value of KSPC will
give a better performance for the system. A year
later, there was another research to evaluate the
accurate measurement of predictive system in case
of typing errors caused by users (Soukoreff and
Mackenzie, 2003). This evaluation was done by
minimum string distance (MSD) error rate and
KSPC. The research results were a new equation for
MSD error rate and development of KSPC formula.
Using this development, the used bandwidth which
represents useful information that was transferred
will be determined. Besides, it could determine the
wasted bandwidth and the total error rate.
The next research was a survey that revealed
several factors associated with the predictive text
system (Vitoria and Abascal, 2006). The research
stated that there are eight important factors that
affect a predictive text system. They were size of the
text block, dictionary structure, prediction method,
effect of the language used, the system interface,
system adaptability, system usability, and other
special features. The result stated that there are five
prediction methods that can be used in predictive
text. They are prediction using frequencies,
prediction using word probability tables, syntactic
prediction using probability tables, syntactic
prediction using grammar, and semantic prediction.
The survey also concluded that the result of a
predictive text system was expressed in terms of
keystroke saving and hit ratio or predictive accuracy
of a system can be considered as another measure
tools of predictive text system.
In 2008, there was a research that found a
standard of keystroke saving in evaluating a word
prediction system (Trnka and McCoy, 2008). The
result of this research stated that there are two limits
or boundaries that can become a standard evaluation
of a word prediction system. The two limits are
theoretical keystroke saving limit and vocabulary
limit.
Furthermore, there was a new development of
the predictive text system by incorporating some
prediction methods, such as using the rules of
English grammar to help text prediction and by
adapting to the amount of word usage frequency
(Nalavade, Mahule and Ketkar, 2008). The research
result declares the incorporation of these methods
can reduce KSPC by 26.91% compared to the T9
predictive text system.
The combination of semantic methods,
frequency, and part-of-speech model on keypads
was used in the next research (Gong, Tarasewich,
and MacKenzie, 2008). The result showed that it can
improve the text entry speed by 10% and reduce
errors as much as 20% depending on the keypads. A
year later, subsequent research did a combination of
syntactic and semantic method (Ganslandt, Jorwall,
and Nugues, 2009). The result declared that it can
reduce KSPC error in the Sweden corpus as much as
12.4%. In addition, when the combination of
syntactic and semantic coupled with the bigram
method, it can reduce the error up to 29.4%.
The next research was about a predictive text
system based on n-gram method (Verberne, et.al,
2012). N-gram was known as buffer and there are
two forms of buffer types (n-gram) which are
'current prefix of the word' and 'buffer15'. The
'buffer15' gave a better result than 'prefix of the
current word'. The summary of the combination of
predictive methods can be seen in Table 1.
3 PROPOSED ALGORITHM
The purpose of this research is to develop predictive
text system by combining some prediction methods
that hopefully can give smaller KSPC value than
previous researches. Methods that are used in this
research are:
3.1 Frequency
Frequency method is used to rank words in the word
table. It is based on how many times the word were
typed by the user. This method works by adding the
value of used word incrementally. By using this
method, predictive text system will offer words that
ICAART2014-InternationalConferenceonAgentsandArtificialIntelligence
306