Speech Source Tracking Based on Particle Filter under non-Gaussian
Noise and Reverberant Environments
Ruifang Wang
1, a
, Xiaoyu Lan
1, b
1
School of Electronic and Information Engineering, Shenyang Aerospace University, Shenyang 110136, China
Keywords: Speech Source Tracking, non-Gaussian Noise, Particle Filter, Generalized Correntropy Function.
Abstract: Tracking a moving speech source in non-Gaussian noise environments is a challenging problem. A speech
source tracking method based on the particle filter (PF) and the generalized correntropy function (GCTF) in
non-Gaussian noise and reverberant environments is proposed in the paper. Multiple TDOAs are estimated
by the GCTF and the multiple-hypothesis likelihood is calculated as weights for the PF. Next, predict the
particles from the Langevin model for the PF. Finally, the global position of moving speech source is
estimated in term of representation of weighted particles. Simulation results demonstrate the vadility of the
proposed method.
1 INTRODUCTION
Tracking a speech source accurately in reverberant
environments is desirable for teleconferencing
system (B. Kapralos, M. R. M. Jenkin and M.
Evangelos, 2003), robots (K. Nakadai, et al, 2006),
and human-machine interaction (T.P. Spexard, M.
Hanheide, and G. Sagerer, 2007). Acquiring the
position of the speech source plays an important role
in speech signal processing region. The
environmental noise and reverberation of the speech
signal are two challenging problems for speech
source tracking. In conventional speech source
localization and tracking approaches (E. T. Roig, F.
Jacobsen and E. F. Grande, 2010), (M. F. Fallon,
and S. J. Godsill, 2012), they only depend on the
current observations to estimate the positions of the
speech source. To improve tracking performance,
Bayesian filtering algorithms are used to track the
moving speech source, which employs not only
current observations but also previous observations.
The particle filter (PF) is an approximation of the
optimal sequential Bayesian estimation via Monte
Carlo simulations for non-linear and non-Gaussian
system. The PF incorporated multiple-hypothesis
model was applied to the speaker tracking problem
based upon TDOA observations (abbreviated to PF)
(D. B. Ward, E. A. Lehmann and R. C. Williamson,
2003). A novel framework of PF based on
information theory was discussed for speaker
tracking (F. Talantzis, 2010). A non-concurrent
multiple talkers tracking based on extended Kalman
particle filtering (EKPF) was proposed (X. Zhong,
and J. R. Hopgood, 2014). In (X. Zhong, A.
Mohammadi, et al, 2013), a distributed particle filter
(DPF) was proposed in speaker tracking in a
distributed microphone network, in which each node
runs a local PF for local posteriors fused to obtain a
global posterior probability (abbreviated to DPF-
EKF). In (Q. Zhang, Z. Chen, and F. Yin, 2016), a
distributed marginalized auxiliary particle filter was
proposed for speaker tracking.
For above-mentioned speech source tracking
methods, the background noise is assumed to be
Gaussian noise. However, the practical background
noise may be non-Gaussian noise such as knock on
the door, sudden phone ringing and a fit of couching,
which is impulsive in essence and would lead to
poor tracking performance for these speech source
methods. To remedy impacts of non-Gaussian
background noise on tracking performance, a PF
based speaker tracking method under non-Gaussian
noise environments is proposed. First, the symmetric
alpha-stable (SαS) distributions (M. Shao and C. L.
Nikias, 1993) are employed to model the non-
Gaussian noise and TDOA observations of speech
signals received between a microphone pair at each
node are approximated via a generalized correntropy
function (GCTF) (W. Liu, P.P. Pokharel, et al,
2007). Next, the Langevin model (D. B. Ward, E. A.
Lehmann and R. C. Williamson, 2003) is used to
Wang, R. and Lan, X.
Speech Source Tracking based on Particle Filter under non-Gaussian Noise and Reverberant Environments.
DOI: 10.5220/0008874204610466
In Proceedings of 5th International Conference on Vehicle, Mechanical and Electrical Engineering (ICVMEE 2019), pages 461-466
ISBN: 978-989-758-412-1
Copyright
c
2020 by SCITEPRESS – Science and Technology Publications, Lda. All rights reserved
461