NOISE REDUCTION BASED ON CROSS TF ε-FILTER
Tomomi Abe
Major in Pure and Applied Physics, Waseda university 55N-4F-10A, 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
Mitsuharu Matsumoto, Shuji Hashimoto
Department of Applied Physics, Waseda university 55N-4F-10A, 3-4-1 Okubo, Shinjuku-ku, Tokyo, 169-8555, Japan
Keywords:
Noise reduction, speech enhancement, ε-filter, time-frequency domain.
Abstract:
A time-frequency ε-filter (TF ε-filter) is an advanced ε-filter applied to complex spectra along the time axis. It
can reduce most kinds of noise while preserving a signal that varies frequently such as a speech signal. The
filter design is simple and it can effectively reduce noise. It is applicable not only to small amplitude stationary
noise but also to large amplitude nonstationary noise. However when we consider the noise that varies much
frequently along the time axis, TF ε-filter cannot reduce noise without the signal distortion. When we consider
the noise where the neighboring frequency bins have similar powers such as impulse noise, we can reduce the
noise by using ε-filter applied to the complex spectra not along the time axis, but along the frequency axis.
This paper introduces an advanced method for noise reduction that applies ε-filter to complex spectra not only
along the time axis but also along the frequency axis labeled cross TF ε-filter. We conducted the experiments
utilizing the sounds with stationary, nonstationary and natural noise.
1 INTRODUCTION
Noise reduction plays an important role in speech
recognition and individual identification. When
we consider the instruments like hearing-aids and
phones, noise reduction for a monaural sound is
strongly expected. It will also be easy to miniatur-
ize the system size because it requires only one sig-
nal. The spectral subtraction (SS) is a well-known ap-
proach for reducing the noise signal of the monaural-
sound (Boll, 1979). It can reduce the noise effectively
despite of the simple procedure. However, it can han-
dle only the stationary noise. It also needs to estimate
the noise in advance. Although noise reduction uti-
lizing Kalman filter has also been reported (Kalman,
1960; Fujimoto and Ariki, 2002), the calculation cost
is large. Some authors have reported a model based
approach for noise reduction (Daniel et al., 2006).
In this approach, we can extract the objective sound
by learning the sound model in advance. However,
it is not applicable to the signals with the unknown
noise as well as SS. There are some approaches uti-
lizing comb filter (Lim et al., 1978). In this approach,
we firstly estimate the pitch of the speech signal, and
reduce the noise signal utilizing comb filter. How-
ever, the estimation error results in the degradation
of the speech quality. Some authors have reported
the method utilizing ε-filter (Harashima et al., 1982;
Arakawa et al., 2002). ε-filter is a nonlinear filter,
which can reduce the noise signal with preserving the
signal. ε-filter is simple and has some desirable fea-
tures for noise reduction. It does not need to have the
model not only of the signal but also of the noise in
advance. It is easy to be designed and the calculation
cost is small. It can reduce not only the stationary
noise but also the nonstationary noise. However, it
can reduce only the small amplitude noise in princi-
ple. To solve the problems, the method labeled TF
ε-filter was proposed (Abe et al., 2007). TF ε-filter
is an improved ε-filter applied to the complex spec-
tra along the time axis in time-frequency domain. By
utilizing TF ε-filter, we can reduce not only small am-
plitude stationary noise but also large amplitude non-
stationary noise. However TF ε-filter cannot reduce
the noise without distortion when the noise changes
frequently along the time axis such as impulse noise.
To solve the problem, we apply ε-filter to complex
spectra not only along the time axis but also along the
105
Abe T., Matsumoto M. and Hashimoto S. (2008).
NOISE REDUCTION BASED ON CROSS TF -FILTER.
In Proceedings of the International Conference on Signal Processing and Multimedia Applications, pages 105-112
DOI: 10.5220/0001935001050112
Copyright
c
SciTePress