Keywords: Markov decision process, Anti-jamming, Impulse jamming.
Abstract: This paper proposed a method through a time domain Markov decision process as a countermeasure of
random periodic impulse jamming for a user in a time slotted environment. First, the random periodic
impulse jamming is modelled. Then the time domain MDP-based anti-jamming communication model is
proposed and the optimal transition probability on each state is calculated. Finally, we proposed an online
learning algorithm to approach the optimal transition probabilities. Simulation results show that our method
is better than other countermeasures of impulse jamming.
1 INTRODUCTION
Impulse jamming can corrupt the data transmission
of communication system(Poisel, 2011) in various
applications like IoT systems(Landa et al., 2017),
OFDM systems(Epple and Schnell, 2017) et al. A
short form periodic jamming (SFPJ) attack can cause
huge reduction of packet delivery ratio (PDR) with
little cost and traditional anti-jamming schemes such
as spread spectrum techniques in frequency domain
is not appropriate to the situation due to the impulse
signal has a wide spectrum density(Debruhl and
Tague, 2013). One usual pulse jamming pattern is
called periodic impulse jamming, which generates
impulse jamming periodically. The jamming source
is widely distributed in practice, such as high-
voltage equipment(Lin et al., 2015). Despite the
interferences generated by nature, impulse jamming
is also commonly used by malicious users to corrupt
communication links. Jie et al. derived a closed form
of BER (Bit Error Rate) of optimal periodic impulse
jamming for QPSK system(Jie et al., 2017). As a
countermeasure, the detection of periodic impulse
jamming is studied. Yuan Yuan He, et al.(He et al.,
2008) used wavelet transforming method to estimate
impulse jamming.
Instead of periodic impulse jamming, malicious
user can use variants of periodic impulse jammings
to improve jamming effect and to avoid being
detected. Random periodic impulse jamming is a
kind of impulse jamming whose occurrence time
obeys some distribution. We proposed a Markov
decision process (MDP) based countermeasures to
mitigate the jam effects.
This paper is organized as follow. The system
model is introduced in Section 2. In Section 3, we
calculated the optimal transmission probability
under certain jamming probability. An online
learning algorithm is provided in section 4 to obtain
the optimal transition probability vector. Section 5
presents computer simulation results. Finally, in
Section 6, some concluding remarks are provided.
2 SYSTEM MODEL
Consider the situation where a synchronized time-
slotted communication system consists 2 licensed
users, one of which sender, the other receiver. In
each time slot, the sender sends a frame with length