3.1.4 Bayesian Path Waypoint Prediction
Based on the improved probabilistic dynamic model
in Eq. 4 and the goal intention evaluation from Eq. 1,
one can estimate the next possible position
ˆ
x
x
x
i+1
∈
ˆ
X
considering a candidate goal region through
Pr(x
x
x
i+1
|X
X
X
1:i
,θ
η
) = Pr(x
x
x
i+1
|x
x
x
i
,θ
η
) ×Pr(θ
η
|X
X
X
1:i
).
(6)
Intuitively, one can further, based on Eq. 6, recur-
sively estimate the position of the robot x
x
x
i+ j
in the
coming time horizon
Pr(x
x
x
i+ j+1
|X
X
X
1:i
,θ
η
) =
∑
x
x
x
i+ j
∈
˜
X
X
X
i+ j
[Pr(x
x
x
i+ j+1
|x
x
x
i+ j
,θ
η
)
×Pr(x
x
x
i+ j
|X
X
X
1:i
,θ
η
)].
(7)
However, as mention in (Best and Fitch, 2015), the
analytical evaluation of Eq. 7 is difficult due to the
branching factor of the roadmap. Therefore, the
trajectory waypoints will be estimated through the
Monte-Carlo sampling approach.
Based on the evaluated goal intention probability
distribution in Eq. 1, N
η
trajectories will be sampled
from current position x
x
x
i
to the goal region θ
η
. In
each sampling, the next possible position node
ˆ
x
x
x
i+ j
in the region
˜
X
X
X
i+ j−1
is chosen based on the probabil-
ity given the improved probabilistic dynamic model
in Eq. 4.
Rather than greedily choosing the most sampled
nodes at each prediction time step, a sum-pooling pro-
cedure in this work is utilized to generate the final
predicted waypoints. Each visited node in
ˆ
X
X
X
i+1:i+ j
at the time step j will be pooled and converted into
a grid graph
¯
X ∈ R
c
x
×c
y
, where the parameters c
x
and c
y
are the number of grids in x-/y-coordinate of
¯
X , respectively. Then, for each goal region, the vis-
iting times of each grid on the graph will be summed
into a counting map M
M
M
c
at each prediction time step,
which indicates the grid’s frequency of being visited.
The sum-pooling process sacrifices the prediction ac-
curacy to reduce the distribution unbalance of the gen-
erated nodes on the roadmap
ˆ
X and eliminate the
prediction of too short paths, especially when a rel-
atively small α in Eq. 4 is chosen. At the end, for
each goal region, the most visited grid to the goal re-
gion at each prediction time step will be recorded and
formulated as the predicted waypoint of the observed
robot as X
X
X
pred
i+1:i+k
:= {x
x
x
pred
i+ j
∈ R
2
|j ∈ [1, ...,k ], θ
η
}.
3.2 Optimization-based Trajectory
Prediction
The proposed method in Section 3.1 can effectively
provide a rough path based on the goal intention eval-
uation and Monte-Carlo sampling approach. How-
ever, the predicted path is discontinuous and only
composed of several key waypoints. Besides, the def-
inition of the available area
˜
X
X
X
i
is based on the param-
eters r
PRM
and k
PRM
in Eq. 3. These two parameters
concern the probabilistic completeness of the gener-
ated roadmap, and the physical limitations of the ob-
served robot are neglected. Therefore, the time step
mentioned in last section cannot provide a precious
time allocation of the predicted paths. At the second
stage, instead of using a velocity model to estimate
the time allocation along the predicted line segments
in (Best and Fitch, 2015), an optimization-based tra-
jectory prediction in this work is proposed to predict a
more reasonable trajectory in the future based on the
previously indicated path waypoints.
The proposed optimization formulation will deter-
mine an optimized trajectory that fulfils several rea-
sonable constraints. First, the robot’s dynamics func-
tion and some observable physical limitations need to
be satisfied. For instance, the observed maximal ab-
solute velocity and the acceleration can be estimated
given the robot’s previous trajectory, and they are uti-
lized to bound the predicted state of the mobile robot
in the optimization problem. Besides, since the robot
has its certain motion intention, the robot is unlikely
to linger about the scenario. Therefore, the estimated
total travel time and the optimized trajectory distance
should be minimized as possible. Furthermore, the
predicted trajectory should pass through the previ-
ously estimated path waypoints in sequence. To that
end, the complementary progress constraints (CPC)
are introduced in the proposed optimization problem,
which will be detailed in the next section.
3.2.1 Complementary Progress Constraints
At this stage, an optimized trajectory with a fixed
time interval will be generated to present the most
likely trajectory of the observed omnidirectional mo-
bile robots in the future. The time interval dt is de-
fined as t
N
/n
p
, where the number of the new gen-
erated optimized trajectory nodes is marked as n
p
,
and t
N
is the total travel time of the optimized tra-
jectory. To handle the predicted path waypoints, a
progress variable set Λ
Λ
Λ := {λ
λ
λ
p
∈R
n
w
|p = [1,..., n
p
]}
is introduced to indicate whether the optimized tra-
jectory passes through the desired waypoints in a se-
quence (Foehn and Scaramuzza, 2020). Here, n
w
is
the number of path waypoints estimated from last sec-
tion, which satisfies n
p
> n
w
obviously. Due to the
sum-pooling procedure in last section, the number of
the waypoints n
w
meets n
w
≤ k , since the predicted
waypoint at different time step j may stay at the same
grid on the counting map M
M
M
c
.
Optimization-based Trajectory Prediction Enhanced with Goal Evaluation for Omnidirectional Mobile Robots
267