(not hidden), since they produce observable events.
These observable events may stem from all imagin-
able modes of human communication. They involve
verbal, as well as non-verbal modalities like gestures
or facial expressions.
The task is to find a model that characterizes the hu-
man user, based on his/her intentions, while consider-
ing the actions a user can perform as a consequence
of these intentions. This is called a forward model,
since it covers only causal dependencies, namely the
dependency of actions upon intentions.
3 RECOGNIZING INTENTIONS
FROM ACTIONS
We address the recognition of user intentions as an al-
gorithmic reasoning process that infers hidden inten-
tions from observed actions. Since observations made
by a robotic system, as well as the correlation between
intentions and actions suffer from uncertainties, we
propose the application of a probabilistic model.
Hidden Markov Models (HMM) are well known
stochastic models for collecting information se-
quences over time in order to make estimates on hid-
den states (Rabiner, 1989). Unfortunately, they pro-
vide only a relatively simplistic way for describing
causal structures. More sophisticated models are pro-
vided by autoregressive HMMs or factorial HMMs.
The first kind treats the dependency between suc-
cessive measurements, whereas the second kind is
concerened with multiple sequences of hidden states
jointly causing a common measurement.
All these models (HMM, AR-HMM, and factor-
ial HMM) can be viewed as members of the Dy-
namic Bayesian Networks family (DBN) (Roweis and
Ghahramani, 1999). DBNs of arbitrary structure pro-
vide a higher flexibility in modeling than generic
HMMs, since they exploit the causal dependency
structure of the given domain.
In literature DBNs are often limited to discrete-
valued domains (Korb and Nicholson, 2003) and hy-
brid Networks are only considered for special cases
(Murphy, 2002). It is obvious, that the domain of
human-robot interaction can only be described by a
joint set of continuous and discrete variables. Sen-
sor measurements and the corresponding probabilis-
tic models for example rely heavily on physical laws
that are based on continuous scales like meters, de-
grees, and so on. Higher level or semantic aspects of
human behavior are often expressed by discrete vari-
ables. Hence, hybrid DBNs are very important for
intention recognition in human-robot cooperation.
In this paper we present a new approach for
user intention recognition based on Hybrid Dynamic
Bayesian Networks. The proposed approach uses
Gaussian mixture densities, i.e. sums of weighted
Gaussian densities, to describe continuous uncertain-
ties. Discrete uncertainties are described by sums of
weighted Dirac pulses.
4 BAYESIAN NETWORKS
Bayesian Networks are considered to be an effi-
cient representation of joint probabilities, exploit-
ing causal dependencies in a domain (Pearl, 1988).
This is achieved by representing the causal depen-
dency structure of a domain by means of a directed
acyclic graph (DAG). Each variable in such a domain
is depicted by a node in this graph and every edge
stands for a direct dependency between two variables.
Hence, this graph is often referred to as dependency
graph. The dependency between two variables x and
y denoted by an edge from node x to node y is mod-
elled by a conditional probability function f (y|x).
Since the direction of the edge represents the causal
dependency of y upon x, we call this a probabilistic
forward model. To describe the joint probability of all
variables in the system, not all possible combinations
of variables and their states have to be addressed. It
is sufficient to consider the conditional probability for
each variable given its parents in the graph.
The first Bayesian network models were limited to
discrete valued domains and their likelihood functions
were given by conditional tables. The most common
approach for evaluating discrete networks by means
of message passing (Pearl, 1988). In this approach
observations or measurements are incorporated into
the according nodes. These nodes send message prob-
abilities to their adjacent nodes, depending on the
modeled conditional probabilities. In this way the in-
formation travels through the network.
This approach was extended to continuous net-
works (Driver and Morrell, 1995), where Gaussian
mixtures were used to approximate the conditional
density functions and to represent the messages trav-
eling through the network.
Hybrid Bayesian networks today consider often
only linear dependencies by using so called cg-
potentials (Lauritzen, 1992). Nonlinear dependencies
cannot be covered in this type of model The treatment
of nonlinear dependencies between variables requires
more complex density representations than offered by
cg-potentials. Approximating the conditional density
functions by means of Gaussian mixtures is a well
known approach (Driver and Morrell, 1995). We ex-
tended this approach tow hybrid domains (Schrempf
and Hanebeck, 2005). Since this is the approach we
propose for intention recognition, we give a short in-
troduction in the next subsection.
ICINCO 2005 - ROBOTICS AND AUTOMATION
252