depicted in Figure 4. Attached to the hypotheses
structures are automatically generated interpretation
rules, realized by the JESS rule system. The rules fire
if evidence for any concept arrives. If the evidence
fits several concepts, it is assigned to each alternative,
and independent interpretation threads are created for
the alternatives.
In our case, the evidence describing the predic-
tion situation immediately causes the creation of six
alternative threads representing possible courses of
events, two for each of the three kinds of ServeACof-
feeScene. For each kind, one of the two threads speci-
fies area instantiations for a service from the north, the
other for a service from the south. Since both compo-
nents of PlaceObjectMug are introduced as evidence,
the aggregate PlaceObjectMug is instantiated imme-
diately, as a necessary robot activity to achieve goal
mugOnPA postulated as evidence.
SCENIOR now performs prediction by “thinking
ahead”, realized by advancing a simulated time. At
the beginning of the prediction phase, the temporal
constraint nets in all threads of SCENIOR indicate
that the robot should start moving (MoveBase) to the
designated premanipulation area as a possible way to
complete evidence for higher-level aggregates (and
thus possibly achieve the goal). Hence MoveBase
is hallucinated for each thread, i.e. instantiated in
prediction mode without evidence. After a while (of
simulated time), the robot reaches the designated pre-
manipulation area, and the occurrence RobotAtPMA
is hallucinated. In the threads where blocking is ex-
pected, this leads to a completed ServeACoffeeShort-
NotBlockedActivity since the PutMugToPA has been
instantiated earlier.
For the other kinds of ServeACoffeeShortScene
the hypotheses graphs imply that the manipulation
area will be blocked and this can be observed by
the robot. The occurrences MABlockedByPerson or
MABlockedByTable are therefore hallucinated while
the robot is approaching the premanipulation area. In
the case of a person blocking the area, the robot has
learnt to wait until the area will be freed, and then to
continue serving the placement area from the antici-
pated manipulation area. In the case of a static obsta-
cle, like a table blocking the manipulation area, the
robot has learnt to turn around and move to the other
side of the table, serving the guest from the left as an
exception. These activities are hallucinated in their
respective order as the simulated time advances, and
finally the goal is achieved. The alternative threads
allow to predict completion times based on the tem-
poral model. As it turns out, they differ considerably
for our slow robot waiter depending on the blocking
situation.
Table 1: Expected minimal durations for serving a coffee.
Course of Activities Start Finish Duration
(ServeACoffeShortScene) (MugOnPA)
NotBlockedAct. 14:48:28 14:49:13 00:00:44
BlockedDynamicAct. 14:48:28 14:49:43 00:01:15
BlockedStaticAct. 14:48:28 14:54:44 00:06:12
Note that SCENIOR typically entertains a large
number of threads during a prediction process, often
more than one hundred. The threads represent alterna-
tive partial predictions due to ambiguous assignments
(several PMAs and MAs are possible) and also due to
the strategy, adopted for real-life scene interpretation,
to doubt all evidence. In our prediction experiments,
the threads are rated by a measure of completeness,
hence incomplete predictions are discarded at the end.
5 EXPERIMENTS AND
EVALUATION
In this section, we describe experiments carried out
with concrete predictions, and a first evaluation of
the approach. The first prediction experiment is
based on the ontological structures illustrated in Fig-
ure 4. SCENIOR has received background knowledge
about area attachments (areaAttachedSAPA1, etc.),
evidence about the current situation (guestAtSA1,
robotAtCounter1, holdingMug1) and postulated evi-
dence about the goal mugOnPA1.
Screenshots of alternative predictions determined
by SCENIOR for this evidence are shown in Figures
5 and 6 for Course 2 and Course 3 of Scenario C,
respectively, as described in Sections 3 and 4. The
screenshot for Course 1 cannot be shown for lack
of space. Downward arrows indicated the compo-
sitional structure of aggregates, upward arrows indi-
cate instantiations. Evidence is depicted by white
boxes (at the bottom), concepts instantiated through
evidence by dark gray boxes (in the middle), and hal-
lucinated instantiations by light gray boxes (in the top
area). Each box also shows the ranges for the starting
and finish time. For hallucinated instantiations, most
ranges remain uncertain to some extent, according to
the possible time intervals specified by the TCN.
For a real-life application, the expected minimal
durations for serving a coffee shown in Table 1 would
probably be the most interesting prediction data. As
to be expected, the obstacle-free service takes the
shortest time. Waiting for a person to move out of
the way causes a slight delay. Turning around and
travelling to the other side of the table when facing
a static obstacle causes a major delay. In our experi-
ment, the quantitative values result from durations de-
ARobotWaiterthatPredictsEventsbyHigh-levelSceneInterpretation
473