
by disables evidencing severe mental impairments.
With respect to the size of the symbol table linear
scansion, row-column scansion and scansion at sub-
groups do not offer good performance if the number
of symbols is elevate (50 symbols and more). Hence,
current non predictive scansion strategies do not allow
to verbal disables suffering motor disorders a relevant
reduction in the time spent to compose sentences (Lee
et al., 2001).
Although predictive scansion strategies could as-
sure better performance (Koester and Levine, 1994),
they are currently adopted in a small number of AAC
assistitive technology aids. In particular symbolic pre-
diction is currently adopted only in VOCAs, but it evi-
dences severe limitations: it predicts symbols accord-
ing to a strict set of sentences previously registered
and do not exploit a linguistic behavior model of the
user. In such a way such prediction system allows the
user to compose a fixed number of messages and it
is not able to generalize allowing the composition of
new messages (Higginbotham et al., 1998). In litera-
ture numerous predictive techniques have been devel-
oped, but they have been applied mainly in alphabet-
ical prediction. In such context the main prediction
techniques (Aliprandi et al., 2003) employ a statisti-
cal approach (based on hidden Markov models and
Bayesian networks), a syntactic and strong syntac-
tic approach (based on linguistic models), a semantic
approach (based on semantic networks), and hybrid
approaches. To the best of our knowledge, currently
symbolic predictive models do not exist.
The main issue with alphabetical predictive tech-
niques, that prevents their use for symbolic predic-
tion, is related to the size of the dictionary of items
to be predicted and their composition rules. In fact,
alphabetical prediction operates on a limited num-
ber (about 20) of items, the alphabetic signs, that
can be organized in words known a priori. Con-
versely, symbolic prediction operates on a set of sym-
bols variable in number that can be organized in dif-
ferent sequences according to the peculiar user lin-
guistic capabilities. In addition alphabetical predic-
tion techniques do not match with the symbolic pre-
diction issue. On the other side, a pure statistical ap-
proach does not keep into account the peculiar AAC
language structure, in fact each verbal impaired user
adopts/develops an own syntactic model according to
his/her residual mental capacities. This is also the rea-
son for which the utilization of a pure syntactic ap-
proach for any user can not be achieved, and a pure
semantic approach does not address the variability re-
lated to the residual user capacities.
We consider an ad-hoc hybrid approach as the right
choice in this context; in the following sections of the
paper we focus on the description of this prediction
model since it represents the most original part of our
work.
3 BLISS SYMBOLS PREDICTION
AND GRAPHICAL MODELS
In our work we focus on the Bliss language (Bliss,
1966), since it is the most adopted and expressive
among AAC languages. In the design of a composi-
tion assistant to predicts Bliss symbols, a set of pecu-
liar requirements regarding both the human-computer
interface and the prediction model can be established.
The composition assistant should suggest a lim-
ited number of symbols (around 4-5) not to confuse
the disable user (Koester and Levine, 1994), the pre-
diction must be accomplished in real time, and the
scansion rate must be adaptable with the user needs
(Cronk and Schubert, 1987). This last aspect ad-
dresses issues due to the high variability of residual
mental and motor capabilities, in fact the composition
assistant should be able to adapt the scansion rate ac-
cording to the time required by the specific disable
to read and select the highlighted symbol. With re-
spect to the prediction model, a verbal impaired user
can adopt all the Bliss symbols (about 2000), even
if he/she usually adopts only a part of them (usually
from 6-7 to 200), and it should be taken into account
that the symbol to be predicted depends in some ex-
tents on the symbols selected previously.
We have adopted a semantic/probabilistic approach
to model the user language behavior and we use this
model in order to predict the most probable symbols
to be suggested by an automatic scansion system. We
have used the semantic approach to take advantage of
a Bliss symbols categorization and the probabilistic
approach both to take into account for uncertainties
in the user language model and to give a reasonable
estimate of the reliability of the proposed prediction.
In CABA
2
L we have used a graphical model based
on a variation of a classical Hidden Markov Models
(HMM). Classical HMMs involve states and symbols,
in particular they relate the probability that a particu-
lar symbol is emitted to the probability that the system
is in particular state. Moreover they use a stochas-
tic process to define the transition from a state to the
other (see Figure 2).
In HMM a particular sequence of observation (i.e.
observed symbols) is generated by choosing at time
t = 0 the initial state s
i
∈ S according to an ini-
tial probability distribution π(0), a symbol v
k
is gen-
erated from a multinomial probability distribution b
i
k
associated to state s
i
, and the system move from the
present state s
i
to the next state s
i
0
of the sequence
according to a transition probability a
ii
0
to generate
the next symbol. States in this model are not directly
observable; symbols represent the only information
that can be observed, and this is the reason for the
term hidden in the model name. Notice that classical
HMMs consider symbols as independent from each
CABA2L A BLISS PREDICTIVE COMPOSITION ASSISTANT FOR AAC COMMUNICATION SOFTWARE
91