
transition is defined by a set of preconditions and
postconditions. It is said to be applicable from a
state if its preconditions are satisfied by the state.
The planner recursively identifies all reachable
states from each state and plans actions when
possible in order to guide the world towards the goal
state. An applicable transition from a state may lead
to a failure state, e.g., a robot falling off a cliff. To
ensure system safety, the planner guarantees that no
failure transition fires from a state.
ORICA uses the world state space model as
presented below, to define the real-time behaviour of
the world.
2.1 World State Space Representation
World state space is a set of possible world states.
Each state has transitions leading to other states.
If
where S is finite set of world states,
and where T is the set of all possible
world transitions then:
Sss ∈
21
,
21
, ss ≠ Tt ∈
t :s
1
→s
2
and ,
1 2
)( st =∇ )( st =
where
and ℜ are the domain and range functions
on a transition. ORICA segregates the transitions
into five categories, as given below:
∇
arttree
tttttT UUUU=
Each transition t has two time intervals min∆(t)
and max∆(t), measured from the instant the world
enters
(t). The former represents the minimum
duration after which t can fire, and the latter the
maximum duration before which t must fire. Their
respective min∆ and max∆ are given as:
∇
where 0 < firing time < ∞, bect(t
a
) and wcet(t
a
)
represent the best-case and the worst-case execution
times of the action transition. The event, temporal
and guaranteed temporal transitions, represent the
environmental events. An event transition may fire
at any instant as the world enters the domain state.
However, it is not guaranteed to fire. A temporal
transition is guaranteed not to fire before a finite
time delay, while a guaranteed temporal transition is
guaranteed to fire between min∆ and max∆ delays.
The guaranteed event and action transitions
represent the agent’s responses. The guaranteed
event guarantees that the system will react at
predefined deadline after the domain state of the
transition is reached. The action guarantees that the
system will respond before a finite time deadline.
Two types of sub-regions are defined in the
world state space, i.e., the safe region and the threat
region. The “safe region” is a set of “safe states”
from which direct failure is impossible. The threat
region consists of the failure states and “threat
states”, states from which failure is possible.
The AI planner may build a plan leading to a
state inside a threat region, in order to achieve the
goal. However, it must ensure that a guaranteed path
will take the world to safe region by pre-empting the
failure transitions in the threat region. This is done
by planning a guaranteed transition, i.e., an action, a
guaranteed event or a guaranteed temporal
transition, which will fire before failure can occur,
i.e.,
max∆(gt(s)) < min∆(ttf (s))
where gt(s) and ttf(s) are respectively guaranteed
and temporal transition to failure from state s.
A guaranteed event is like a watch-dog timer set
when the world reaches its domain state. It enables
the system to modify its beliefs about the world state
when no external event occurs by a finite time.
2.3 Dependent Temporal Transitions
to Failure
All transitions inside a threat region, taking the
world from a threat state to another, consume a finite
amount of time (except for an event transition which
may be instantaneous). Inside the threat region the
world moves towards a failure.
The time to failure from a threat state is
represented as a dependent temporal transition to
failure (dttf) and ORICA states that it depends on the
length of previous transitions in the threat region and
their effect on the cause leading to failure (Omar,
04), e.g., throwing water on fire may reduce its
spread but throwing oil will speed it up.
Transition Symbol min∆ max∆
Event t
e
0 ∞
Guaranteed event t
re
firing time firing time
Temporal t
t
> 0 ∞
Guaranteed
temporal
t
rt
> 0 < ∞
Action t
a
bcet(ta) Sensing
delay+wcet(ta)
If the previous state s
i-1
is more threatening than
the currents state s
i
then the time to failure from the
current state is given as:
))((min
i
sdttf
))((max
))((min
))((min
1
1_
−
∆−
⎥
⎥
⎦
⎤
⎢
⎢
⎣
⎡
∆
×∆
ia
i
i
st
sttf
X
sttf
Otherwise it is given as:
⎥
⎥
⎦
⎤
⎢
⎢
⎣
⎡
∆
∆−
×∆=∆
−
))((min
))((max
))((min))((min
1_
1
i
ia
ii
sttf
stX
sttfsdttf
BEST-ACTION PLANNING FOR REAL-TIME RESPONSE - An approach in ORICA
307