3.2.3 Spectrum
The spectrum of frequencies present in the voiced
sample is used in two steps of processing. First, the
spectrum is used for the cepstral analysis, which
serves for to find the fundamental frequency precise
value. The second use of the spectrum provides
input data for the filtering using harmonic
frequencies filters.
The spectrum is calculated by Fourier transform
using its fast form (3).
1, ... ,0
1
0
2
NkexX
N
n
N
n
ki
nk
(3)
3.2.4 Cepstrum, Fundamental Frequency
The next step provides a cepstrum. The cepstra
analysis, as described above, provides cepstral
coefficients.
The real cepstrum is used to find the value of the
fundamental frequency. The value is expected in the
range from 60 to 400 Hz for the human voice
(Campbell, 1997). The peak is to be found in this
range (Figure 1) and converted from the cepstral
coefficient number to the frequency domain. The
fundamental frequency is the base for the calculating
of the harmonic frequencies to be used for the
filtering.
3.2.5 Harmonic Spectrum Vector
When the harmonic frequency filters are set using
the fundamental frequency, the spectrum is filtered
(Figure 6). Because the power at the specific
frequency depends on the volume of the input signal,
the absolute values can not be used. The power
values are related to the power at the fundamental
frequency.
0 100 200 300 400 500 600 700
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
Frequency c ontent
frequency (Hz)
f
2f
5f
Figure 6: The frequency content after filtering.
The power relations between given harmonic and
the fundamental frequency constitute the values of
harmonic frequency vector, we expect to be specific
for the given speaker. The vectors are calculated
from more voiced segments to be ready to process
by statistic methods.
The Figure 5 shows the powers of harmonic
frequencies obtained from the spectrum using
harmonic filters set by the fundamental frequency
value (2).
4 CONCLUSIONS
The proposed technique is in the testing phase. All
the computations are processed in the MATLAB
environment.
The partial results are before the deeper process
of comparision with another methods. If the testing
shows and confirm the measurable dependency of
the voice harmonic spectrum on the given speaker, it
will be usable to improve the reliability of the
speaker identification process based on the
charasteristic features of the speaker’s voice.
ACKNOWLEDGEMENTS
This work was supported by the project No.
CZ.1.07/2.2.00/28.0327 Innovation and support of
doctoral study program (INDOP), financed from EU
and Czech Republic funds.
REFERENCES
Abdulla, W. H., 2002. Auditory based feature vectors for
speech recognition systems. In: Advances in
Communications and Software Technologies.WSEAS
Press, Stevens Point, Wisconsin, USA.
Atassi, H., 2008. Metody detekce základního tónu řeči. In:
Elektrorevue, Vol.4.
Campbell, Jr, J. P., 1997. Speaker recognition: a tutorial.
In: IEEE 85.
Horák, O., 2012. The Voice Segment Type Determination
using the Autocorrelation Compared to Cepstral
Method. In: WSEAS Transactions on Signal
Processing, vol. 8, issue 1.
Horák, O., 2012. Phoneme Recognizer Based Verification
of the Voice Segment Type Determination. In:
Proceedings of the 3rd International conference on
Applied Informatics and Computing Theory (AICT
'12). WSEAS Press, Stevens Point, Wisconsin, USA.
Jung, J. S., Kim, J. K., and Bae, M. J., 2004. Speaker
Recognition System Using the Prosodic Information.
ICSOFT2013-8thInternationalJointConferenceonSoftwareTechnologies
276