A related issue concerns the clock drifts, which may
also cause the learning procedure to fail to converge
to a stable scheduling. We plan to address these
issues in our future work.
5 CONCLUSIONS
In this paper we presented a decentralized rein-
forcement learning (RL) approach for self-organizing
wake-up scheduling in wireless sensor networks
(WSNs). Our approach drives nodes to coordinate
their wake-up cycles based only on local interactions.
In doing so, agents independently learn both to syn-
chronize their active periods with some nodes, so that
message throughput is improved, and at the same time
to desynchronize with others in order to reduce com-
munication interference. We refer to this concept
as (de)synchronicity. We investigated three different
topologies and showed that agents are able to inde-
pendently adapt their duty cycles to the routing tree of
the network. For high data rates this adaptive behav-
ior improves both the throughput and lifetime of the
system, as compared to a fully synchronized approach
where all nodes wake up at the same time. We demon-
strated how initially randomized wake-up schedules
successfully converge to the state of (de)synchronicity
without any form of explicit coordination. As a result,
our approach makes it possible that agent coordina-
tion emerges rather than is agreed upon.
ACKNOWLEDGEMENTS
The authors of this paper would like to thank the
anonymous reviewers for their useful comments and
valuable suggestions. This research is funded by the
agency for Innovation by Science and Technology
(IWT), project IWT60837; and by the Research Foun-
dation - Flanders (FWO), project G.0219.09N.
REFERENCES
Cohen, R. and Kapchits, B. (2009). An optimal wake-
up scheduling algorithm for minimizing energy con-
sumption while limiting maximum delay in a mesh
sensor network. IEEE/ACM Trans. Netw., 17(2):570–
581.
Degesys, J., Rose, I., Patel, A., and Nagpal, R. (2007).
Desync: self-organizing desynchronization and tdma
on wireless sensor networks. In Proceedings of the 6th
IPSN, pages 11–20, New York, NY, USA. ACM.
http://www.omnetpp.org/ – a C++ simulation library and
framework.
Ilyas, M. and Mahgoub, I. (2005). Handbook of sensor net-
works: compact wireless and wired sensing systems.
CRC.
Knoester, D. B. and McKinley, P. K. (2009). Evolving vir-
tual fireflies. In Proceedings of the 10th ECAL, Bu-
dapest, Hungary.
Langendoen, K. (2008). Medium access control in wireless
sensor networks. Medium access control in wireless
networks, 2:535–560.
Leng, J. (2008). Reinforcement learning and convergence
analysis with applications to agent-based systems.
PhD thesis, University of South Australia.
Liang, S., Tang, Y., and Zhu, Q. (2007). Passive wake-up
scheme for wireless sensor networks. In Proceedings
of the 2nd ICICIC, page 507, Washington, DC, USA.
IEEE Computer Society.
Liu, Z. and Elhanany, I. (2006). Rl-mac: a reinforcement
learning based mac protocol for wireless sensor net-
works. Int. J. Sen. Netw., 1(3/4):117–124.
Lucarelli, D. and Wang, I.-J. (2004). Decentralized syn-
chronization protocols with nearest neighbor commu-
nication. In Proceedings of the 2nd SenSys, pages 62–
68, New York, NY, USA. ACM.
Mirollo, R. E. and Strogatz, S. H. (1990). Synchronization
of pulse-coupled biological oscillators. SIAM J. Appl.
Math., 50(6):1645–1662.
Paruchuri, V., Basavaraju, S., Durresi, A., Kannan, R., and
Iyengar, S. S. (2004). Random asynchronous wakeup
protocol for sensor networks. In Proceedings of the
1st BROADNETS, pages 710–717, Washington, DC,
USA. IEEE Computer Society.
Patel, A., Degesys, J., and Nagpal, R. (2007). Desynchro-
nization: The theory of self-organizing algorithms for
round-robin scheduling. In Proceedings of the 1st
SASO, pages 87–96, Washington, DC, USA. IEEE
Computer Society.
Schurgers, C. (2007). Wireless Sensor Networks and Appli-
cations, chapter Wakeup Strategies in Wireless Sensor
Networks, page 26. Springer.
Sutton, R. S. and Barto, A. G. (1998). Reinforcement Learn-
ing: An Introduction. MIT Press.
Watkins, C. (1989). Learning from delayed rewards. PhD
thesis, University of Cambridge, England.
Werner-Allen, G., Tewari, G., Patel, A., Welsh, M., and
Nagpal, R. (2005). Firefly-inspired sensor network
synchronicity with realistic radio effects. In Proceed-
ings of the 3rd SenSys, pages 142–153, New York, NY,
USA. ACM.
Wolpert, D. H. and Tumer, K. (2008). An introduction
to collective intelligence. Technical Report NASA-
ARC-IC-99-63, NASA Ames Research Center.
Ye, W., Heidemann, J., and Estrin, D. (2004). Medium
access control with coordinated adaptive sleeping for
wireless sensor networks. IEEE/ACM Trans. Netw.,
12(3):493–506.
Zheng, R., Hou, J. C., and Sha, L. (2003). Asynchronous
wakeup for ad hoc networks. In Proceedings of the 4th
MobiHoc, pages 35–45, New York, NY, USA. ACM.
SELF-ORGANIZING SYNCHRONICITY AND DESYNCHRONICITY USING REINFORCEMENT LEARNING
103