5 CONCLUSIONS
From the test with synthesized voices, it can be
concluded that the proposed algorithm presents a fair
level of accuracy, in particular for female voices
which is very important feature in a diagnosis scene.
The algorithm doesn’t have significant tendency to
underestimated or overestimate. This feature means
that there is no need to calibrate or compensate the
estimated HNR value. In some cases, calibration
may not be effective due to the acoustic diversity of
human voices. In spite of presenting the second best
values regarding the variation of the errors according
to F0 and theoretical HNR variation, it can be
concluded that the measurement performed by the
proposed algorithm is not substantially affected by
the F0 and the noise level. This means that the
proposed algorithm measures the female and male
voices and several hoarseness levels with
approximately the same accuracy.
From the tests with real voices, the proposed
algorithm showed coherent HNR values,
approximately similar to the HNR values of other
algorithms. The harmonic plus noise model assumes
good approximation of the harmonic and noise
components which are present in pathological
voices. Moreover, the results show that there is a
correspondence between the artifacts that were
associated to harmonic component and to noise
component in each signal representation.
ACKNOWLEDGEMENTS
This work was developed in a doctoral program
supported by the Portuguese Foundation for Science
and Technology under the reference
SFRH/BD/24811/2005.
REFERENCES
Levison, S. E., 2005. Mathematical Models for Speech
Technology, John Wiley & Sons Ltd, The Atrium,
Southern Gate, Chichester,West Sussex PO19 8SQ,
England.
Yumoto, E., and Gould, W., 1982. Harmonics-to-noise
ratio as an index of the degree of hoarseness, The
Journal Acoustic Society of America. 71(6):1544-
1550.
Ferreira ,Aníbal J. S., 1998.An Odd-DFT based approach
to time-scale expansion of audio signals. IEEE
Transactions on Speech and Audio Processing,
7(4):441–453.
Ferreira ,Aníbal J. S., 2001.Accurate estimation in the
ODFT domain of the frequency, phase and magnitude
of stationary sinusoids. In IEEE Workshop on
Applications of Signal Processing to Audio and
Acoustics, pages 47–50. IEEE.
Yegnanarayana, B., d’Alessandro, C., Darsinos, V. 1998.
An iterative algorithm for decomposition of speech
signals into periodic and aperiodic components.
Speech and Audio Processing, IEEE Transactions
6(1): 1-11.
Qi, Y., and Hillman, R. E., 1997. Temporal and spectral
estimations of harmonics-to-noise ratio in human
voice signals. The Journal of the Acoustical Society of
America 102(1): 537-543.
Boersma, P., 1993.Accurate short-term analysis of the
fundamental frequency and the harmonics-to-noise
ratio of a sampled sound. In Proceedings of the
Institute of Phonetic Sciences 17, 97-110.
Murphy, P. J. and. Akande, O., 2006. Noise estimation in
voice signals using short-term cepstral analysis, The
Journal of the Acoustical Society of America. 121
(3):1679-1690.
Michaelis ,D., Gramss, T., Strube H.W., 1997. Glottal-to-
Noise Excitation Ratio a New Measure for Describing
Pathological Voices. Acustica-Acta Acustica 83,700-
706
Fant, G., Liljencrants, J. and Lin, Q., 1985. A four
parameter model of glottal flow. STL/QPSR 4/1985,
French Swidish Symposium 1,1-13
BIOSIGNALS 2009 - International Conference on Bio-inspired Systems and Signal Processing
356