Authors:
Silvana Cunha Costa
1
;
Benedito G. Aguiar Neto
2
;
Joseana Macêdo Fechine
3
and
Menaka Muppa
4
Affiliations:
1
Federal Center of Techological Education of Paraíba-CEFET-PB, Federal University of Campina Grande-UFCG, Brazil
;
2
Federal University of Campina Grande-UFCG, Intitute of Technology of Washington, University of Washington, United States
;
3
Federal University of Campina Grande-UFCG, Brazil
;
4
Intitute of Technology of Washington -University of Washington Tacoma, United States
Keyword(s):
Acoustic voice analysis, speech processing, acoustic features, cepstral parameters, disordered voices, speech pathology.
Related
Ontology
Subjects/Areas/Topics:
Acoustic Signal Processing
;
Applications and Services
;
Artificial Intelligence
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Computer Vision, Visualization and Computer Graphics
;
Data Manipulation
;
Health Engineering and Technology Applications
;
Human-Computer Interaction
;
Medical Image Detection, Acquisition, Analysis and Processing
;
Methodologies and Methods
;
Neurocomputing
;
Neurotechnology, Electronics and Informatics
;
Pattern Recognition
;
Physiological Computing Systems
;
Sensor Networks
;
Soft Computing
Abstract:
Digital signal processing techniques have been used to perform an acoustic analysis for vocal quality assessment due to the simplicity and the non-invasive nature of the measurement procedures. Their employment is of special interest, as they can provide an objective diagnosis of pathological voices, and may be used as complementary tool in laryngoscope exams. The acoustic modeling of pathological voices is very important to discriminate normal and pathological voices. The degree of reliability and effectiveness of the discriminating process depends on the appropriate acoustic feature extraction. This paper aims at specifying and evaluating the acoustic features for vocal fold edema through a parametric modeling approach based on the resonant structure of the human speech production mechanism, and a nonparametric approach related to human auditory perception system. For this purpose, LPC and LPC-based cepstral coefficients, and mel-frequency cepstral coefficients are used. A vector-q
uantizing-trained distance classifier is used in the discrimination process.
(More)