2 HUMAN ATTENTION
Attention is the ability to selectively focus on certain
parts of one’s surroundings, while disregarding the
other parts. Attention has often been compared to a
spotlight, which selectively illuminates objects in a
dark room.
Human attention has been extensively studied by
cognitive psychologists, and there’s a wealth of liter-
ature available on the issue (Chun and Wolfe, 2001).
There are two prevailing schools of thought within
the literature: Filter or attenuation theories, as propa-
gated by Broadbent (Broadbent, 1958) and Treisman
(Treisman and Gelade, 1980), assume that attention
works like a filter. Unneeded perceptions are either
removed or toned down, and do not enter conscious-
ness.
Resource models, on the other hand, propose that
attention is created by the distribution of limited at-
tentional resources (Cohen, 2003). Those process-
ing resources can be allocated to different percep-
tions, which allows them to be consciously perceived.
Those perceptions for which no resources are avail-
able will be discarded.
Both models can be used to explain the results of
psychological experiments (Cohen, 2003). We will
primarily use the resource model, since it makes it
easy to describe attention in computational terms.
Cognitive psychology has revealed many more
mechanisms of attention (Chun and Wolfe, 2001):
• The spotlight of attention can be divided, multiple
objects can be attract attention at the same time.
However, the overall performance always remains
the same.
• Attention can be shifted through a conscious effort.
However, it can also be drawn by certain features of
the environment. For example, a blinking light will
immediately draw a person’s attention. This kind
of attention shift occurs automatically and requires
no conscious effort. This property of attention is
exploited in image processing algorithms which at-
tempt to imitate the visual attention, for an example
see (Backer and Mertsching, 2003).
• While attention has spatial properties, it can also
work on whole objects. This indicates that objects
can be identified in a preattentive processing stage.
3 A MODEL OF ATTENTION
The attention model developed for the hearing aid as-
sumes that the user’s attention can be directed at a
number of possible targets. Each of these targets is a
distinct entity corresponding to an object in the real
world. For example, a speaker in a room would be a
possible target for the user’s attention.
Each target is attributed with a target description.
The descriptions contains the raw sensor data from
that target, and may also contain semantic informa-
tion that can be used for estimating the user’s atten-
tional focus. A target description for a speaker may
consist of the raw audio data from this speaker and
the speaker’s position relative to the user.
The attentional state of the user is the distribution
of the user’s attention over the existing targets. The
distribution is expressed, for each target, as the prob-
ability that the target is the user’s primary focus of
attention. This model is consistent with the psycho-
logical results which indicate that attention is directed
at objects, rather than abstract features.
For estimating the attentional state the possible tar-
gets have to been detected in the sensor information,
and each target’s sensor data is extracted separately.
This may seem like an excessive burden on the pre-
processing stage. However, in the case of the hearing
aid, advanced audio processing has to be an integral
part of the system anyway. All sound sources will
have to be identified and localized, and there has to
be a possibility to enhance each sound source sepa-
rately or feed it to a speech recognition system.
3.1 Estimating the attentional state
The model for the user attention consists of a num-
ber of rules. By assigning a probability to each target
the algorithm creates an estimate of the user’s atten-
tional state. Since the rules are interchangeable, dif-
ferent approaches may be evaluated. This is necessary
since psychological experiments suggest a wealth of
approaches, but it is often unclear how they will be-
have in real-life systems.
There are two basic approaches to determine the
user’s attentional state. One is by predicting the at-
tention based on the user’s current perceptions. The
other is to monitor the user’s behavior in order to find
out where the attention is directed.
Figure 2 shows a coarse overview over the mech-
anisms of the algorithm. We assume that the user
receives perceptions or stimuli from the world and,
depending on his current attentional state, reacts to
those stimuli. The stimuli are recorded by sensors and
transformed into target descriptions in a preprocess-
ing stage. Based on the target descriptions and the
user model the user’s most likely attentional state is
estimated.
Simultaneously, the user’s reactions are monitored.
Through the reactions, the system may observe the
user’s attentional state. Any differences between the
estimated and observed state are fed back into the
model.
ICINCO 2005 - ROBOTICS AND AUTOMATION
148