
selection menu, light to select the light, and switch on
to switch it on, or, if the light menu was the last one
selected, a sequence of length 1: switch on. We call
this menu arrangement hierarchical.
5.2 Connected Menus
In order to support direct switching between menus,
we include each device identifier into each of the
device menus, thus eliminating the selection menu.
Switching on the light requires either a command se-
quence of length 2: light and switch on, or of length 1:
switch on, depending on the recently selected menu.
We call this arrangement connected. Compared to the
hierarchical one, the connected arrangement requires
fewer steps on larger subvocabularies.
5.3 Command Sequence Behaviour
The error rates defined so far apply to single-pattern
reception. Now we define a command sequence fail-
ure rate. A command sequence is considered success-
ful, if each command is recognized correctly, other-
wise it is considered failed. For the purpose of sim-
plicity, we assume that command sequences are not
interrupted by garbage reception. Consequently, the
command sequence failure rate below gives a lower
bound for the actual figure, and the false acceptance
rate has no impact on the command sequence perfor-
mance and is kept as an extra figure. Assuming in-
dependent recognition errors, the failure rate F
L
of a
command sequence of length L is given by
F
L
= 1 −
L
Y
i=1
³
1 − (C
S
i
N
i
+ R
S
i
N
i
)
´
, (10)
where N
i
and S
i
are the recognizer’s vocabulary and
subvocabulary size, respectively, in the user interface
state corresponding to the reception of the i-th com-
mand. Equation (10) holds for all menu arrangements
and all subvocabulary types.
6 BASIC INTERFACE DESIGNS
Combining the alternatives given in Sections 4 and 5,
we identify four basic user interface designs:
design HE: hierarchical menu arrangement, em-
bedded subvocabularies,
design HS: hierarchical menu arrangement, sepa-
rated subvocabularies,
design CE: connected menu arrangement, embed-
ded subvocabularies,
design CS: connected menu arrangement, separated
subvocabularies.
In Section 7, we will evaluate, for three example
systems, these four designs, in order to point out the
advantages and drawbacks of each approach under the
conditions stated.
The evaluation of a design consists of four steps:
step 1: estimate the vocabulary and subvocabulary
sizes corresponding to each menu,
step 2: estimate the necessary speech recognizer
error rates via testing or by using Equations (1) to (3)
with appropriate test figures as inputs,
step 3: calculate the observable error rates for each
menu, using Equations (4) to (6) or (7) to (9),
step 4: for each command sequence type, calculate
the failure rate according to Equation (10).
7 EXAMPLE SYSTEMS
7.1 Example Speech Recognizer
Suppose a speech recognizer with the error rates:
c
10
= 0.005, r
10
= 0.03, and a
10
= 0.20 for a vocab-
ulary of size 10, on which the voice control systems
of Sections 7.2 to 7.4 shall be based. We do not con-
sider modifications of the recognizer itself. Instead,
we ask for the optimum user interface design for the
respective task, given the recognizer as it is.
7.2 Light, Heater, and Telephone
Control
Our first example system shall control a light with
the commands switch on, switch off, brighter, and
darker, a heater with switch on, switch off, warmer,
and cooler, and a hands-free telephone with connect,
disconnect, louder, and softer. The device selectors
are light, heater, and telephone. For the hierarchi-
cal arrangement, select shall be used to enter the de-
vice selection menu. Note that each device has the
same number of control commands, which facilitates
our analysis; in real world systems, different devices
will, in general, have control command sets of differ-
ent sizes. We analyze the system following Section 6,
using Equations (1) to (3) together with the recognizer
figures of Section 7.1 in step 2. The results are shown
in Table 1; there, the select-and-control failure rate
corresponds to command sequences required to select
a device and invoke a control action on it, whereas the
control failure rate corresponds to commands invok-
ing a control action on an already selected device.
In this example, the lowest select-and-control fail-
ure rate is achieved by employing the connected menu
arrangement together with separated subvocabularies;
however, it is paid with the highest false acceptance
rate.
ICEIS 2004 - HUMAN-COMPUTER INTERACTION
24