5 SIMULATION RESULTS
The proposed on-line adaptive control scheme was
verified and compared to the non-adaptive scheme
proposed in (Corradini et al., 2003) by simulations.
The mobile robot was simulated through a discrete-
time model whose error dynamics were given in (7).
The covariance R of the noise vector ε
k
was set to
1 × 10
−6
I
3
, where I
i
denotes an (i × i) identity ma-
trix. Neural network pre-training is not used and, to
demonstrate further the adaptive feature of the pro-
posed controller, the model used for simulations is
abruptly modified by replacing T by 1.5T in (7) in the
middle of the simulation, specifically at 12.5 seconds.
This has the effect of modifying the robot model
considerably without altering its kinematic structure.
The selected reference trajectory was generated recur-
sively (one time step ahead) using:
x
r
(t) = 2 cos(0.25t),
y
r
(t) = 2 sin(0.5t), (37)
θ
r
(t) = arctan ( ˙y
r
/ ˙x
r
) ,
which define a figure-of-eight path in the x-y plane.
Neural networks NN
f
and NN
G
were structured
with 18 and 1 hidden unit neurons respectively. The
EKF initial covariance P
0
was set to 5I
α
, where α is
the total number of weights. The initial weight val-
ues were randomly generated from a normal distrib-
ution with zero mean. The controller design parame-
ters were set according to (36) with the eigenvalues
placed inside the unity circle. Several simulation tri-
als were conducted, each indicating good tracking and
repeatable performance for the proposed algorithm.
A number of results from a particular 25 second du-
ration simulation, with a time step of 10 ms, are de-
picted in Figure 1.
Referring to Figure 1, plots (a) to (g) were gen-
erated using the proposed adaptive controller while
plot (h) corresponds to the non-adaptive controller
proposed in (Corradini et al., 2003) under the same
specified simulation conditions and assuming that the
original robot functions are known. Primarily one
should note that the pose errors shown in plots (a),
(b) and (c) remain well bounded about zero during the
whole trajectory, indicating that the proposed scheme
has learned the nonlinear functions and yields stable
control throughout the simulation, including the pe-
riod following the previously mentioned model vari-
ation at time 12.5 seconds. This variation is over-
come with no more than a slight error transient vis-
ible in (b). This transient dies out quickly once the
controller adapts to the recently modified model. By
contrast, plot (h) reveals that the non-adaptive method
proposed in (Corradini et al., 2003) goes unstable just
after this model variation.
The complete trajectory path in the x − y plane is
superimposed on the reference trajectory in plot (e).
The two are hardly distinguishable due to the good
tracking performance obtained. The relatively high
deviation in the initial part of the trajectory, magnified
in plot (f), corresponds to the learning phase of the
neural networks, which require no preliminary off-
line training.
The neural network weights remain well bounded
at reasonable values. Plot (d) depicts time plots for
three particular weights, selected arbitrarily. For these
particular simulations, the absolute maximum and
minimum weight values over the two trajectories were
8.2 and −9.4 respectively, with the majority of the
weights centred about zero. This indicates that the
proposed algorithm is also practically realizable as no
infinitely high neural network weights are required.
As a result, the control velocities v and ω remained
well bounded with decent magnitudes.
Plot (g) depicts the mean of the diagonal of the co-
variance matrix P
k
in time. This is used to indicate
the uncertainty in the estimated weights, as generated
recursively by the EKF. As expected, the uncertainty
decreases with time, indicating that the EKF is stable
and the neural network estimations are continuously
improving. The slope of this curve is related to the
learning rate of the system.
6 CONCLUSIONS
In this paper a neural network adaptive control
scheme for the trajectory tracking problem of mo-
bile robots is proposed. The resulting algorithm re-
quires no preliminary information about the process
non-linear functions and uses MLP neural networks,
trained online in consideration of the process uncer-
tainties and external disturbances by using the EKF.
The designed scheme was tested repeatedly by simu-
lation for several noise conditions and sudden model
variations, modeling the uncertainty and time-varying
parameters encountered in practical environments. In
contrast to the non-adaptive scheme proposed in (Cor-
radini et al., 2003), the robot exhibited good tracking
performance in each case.
Future research will include the development of
stochastic non-linear control laws (Fabri and Kadirka-
manathan, 2001), which would take into account
the neural network approximation errors recursively
through the readily available covariance matrix P
k
.
Such a controller would then be amalgamated with the
estimation algorithm developed in this paper, replac-
ing the current heuristic certainty equivalence control
law. This is anticipated to give better transient perfor-
mance due to its stochastic features. Stability proofs
in the ambience of stochastic control are very rare due
to the complexity of random processes, but is still part
of our agenda for future work.
MULTILAYER PERCEPTRON FUNCTIONAL ADAPTIVE CONTROL FOR TRAJECTORY TRACKING OF
WHEELED MOBILE ROBOTS
71