0.52
0.53
0.54
0.55
0.56
0 1000 2000 3000 4000 5000 6000
z-coordinate
time step
target
low-gain PID
dynamic estimate
direct mapping
Figure 13: Swinging-the-dumb-bell task. The figure com-
pares low-gain PID control (shown only on the middle tra-
jectory) with the composite-controller that uses either the
predicted mass estimate from dynamics or a direct mapping
from (θ,
˙
θ,
¨
θ, s) onto torques. For the estimate from sen-
sors, the results were similar to the dynamics case and are
omitted in the graph to improve legibility. The true mass
increased continuously from 0 to 0.06.
variables, inferring the mass failed, and thus, the con-
trol torques were inaccurate. In principle, all iner-
tia parameters could be inferred from the dynamics,
but this inference requires modeling a 10-dimensional
latent-variable space, which is unfeasible without ex-
tensive training data.
On the other hand, the direct mapping onto
torques with sensors as additional input could predict
accurate control torques under two varying context
variables and in principle could cope with arbitrary
changes of the manipulated object (including exter-
nal forces). Further advantages of this strategy are its
simplicity (it only requires function approximation),
and for training, no labeled contexts are required.
In future work, we try to replicate these findings
on a real robot arm with real tactile sensors. Real sen-
sors might be more noisy compared to our simulated
sensors; particularly, the interface between sensor and
object is less well controlled.
ACKNOWLEDGEMENTS
The authors are grateful to the German Aerospace Center
(DLR) for providing the data of the Light-Weight Robot III
and to Marc Toussaint and Djordje Mitrovic for their con-
tribution to the robot-arm simulator. This work was funded
under the SENSOPAC project. SENSOPAC is supported
by the European Commission through the Sixth Framework
Program for Research and Development up to 6 500 000
EUR (out of a total budget of 8 195 953.50 EUR); the SEN-
SOPAC project addresses the “Information Society Tech-
nologies” thematic priority. G. P. was funded by the Greek
State Scholarships Foundation.
REFERENCES
Haruno, M., Wolpert, D. M., and Kawato, M. (2001).
Mosaic model for sensorimotor learning and control.
Neural Computation, 13:2201–2220.
Kalman, R. E. (1960). A new approach to linear filtering
and prediction problems. Transactions of the ASME -
Journal of Basic Engineering, 82:35–45.
Narendra, K. S. and Balakrishnan, J. (1997). Adaptive con-
trol using multiple models. IEEE Transactions on Au-
tomatic Control, 42(2):171–187.
Narendra, K. S. and Xiang, C. (2000). Adaptive con-
trol of discrete-time systems using multiple mod-
els. IEEE Transactions on Automatic Control,
45(9):1669–1686.
Petkos, G., Toussaint, M., and Vijayakumar, S. (2006).
Learning multiple models of non-linear dynamics for
control under varying contexts. In Proceedings of
the International Conference on Artificial Neural Net-
works. Springer.
Petkos, G. and Vijayakumar, S. (2007). Context estimation
and learning control through latent variable extraction:
From discrete to continuous contexts. In Proceedings
of the International Conference on Robotics and Au-
tomation. IEEE.
Rasmussen, C. E. and Williams, C. K. I. (2006). Gaussian
Processes for Machine Learning. MIT Press.
Sciavicco, L. and Siciliano, B. (2000). Modelling and Con-
trol of Robot Manipulators. Springer.
Vijayakumar, S., D’Souza, A., and Schaal, S. (2005). In-
cremental online learning in high dimensions. Neural
Computation, 17:2602–2634.
Wolpert, D. M. and Kawato, M. (1998). Multiple paired
forward and inverse models for motor control. Neural
Networks, 11:1317–1329.
SENSOR-ASSISTED ADAPTIVE MOTOR CONTROL UNDER CONTINUOUSLY VARYING CONTEXT
269