On the Problem of Data Availability in Automatic Voice Disorder Detection
Dayana Ribas, Antonio Miguel, Alfonso Ortega, Eduardo Lleida
2023
Abstract
In order to support medical doctors in having more versatile health assistance, automatic voice disorder detection systems enable the remote diagnosis, treatment, and monitoring of voice pathologies. The main problem for developing the related technology is the availability of audio data of healthy and pathological voices manually labeled by experts. Saarbruecken Voice Database (SVD) was created in 1997, with a collection of more than 5 hours of healthy and pathologica audio data. This database has been widely used for developing voice disorder detection systems. However, it has some issues in the distribution of data and the labeling that makes it difficult to conduct conclusive studies. This paper evaluates an Automatic Voice Disorder Detection (AVDD) system using the recent Advanced Voice Function Assessment Database (AVFAD) with almost 40 hours of audio data and SVD as a reference. The system consists of a representation using spectral, prosody, and voice quality parameters followed by an SVM classifier that can obtain up to 88% accuracy in phrases and 86% in sustained vowel a. Data augmentation strategy is assessed for handling the problem of data imbalance with the SMOTE method which improves the performance of male, female, and gender-independent models without decreasing the results for scenarios with data balance. Finally, we release the system implementation for voice disorder detection including the list of train-test partitions for both databases.
DownloadPaper Citation
in Harvard Style
Ribas D., Miguel A., Ortega A. and Lleida E. (2023). On the Problem of Data Availability in Automatic Voice Disorder Detection. In Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF; ISBN 978-989-758-631-6, SciTePress, pages 330-337. DOI: 10.5220/0011669300003414
in Bibtex Style
@conference{healthinf23,
author={Dayana Ribas and Antonio Miguel and Alfonso Ortega and Eduardo Lleida},
title={On the Problem of Data Availability in Automatic Voice Disorder Detection},
booktitle={Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF},
year={2023},
pages={330-337},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011669300003414},
isbn={978-989-758-631-6},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2023) - Volume 5: HEALTHINF
TI - On the Problem of Data Availability in Automatic Voice Disorder Detection
SN - 978-989-758-631-6
AU - Ribas D.
AU - Miguel A.
AU - Ortega A.
AU - Lleida E.
PY - 2023
SP - 330
EP - 337
DO - 10.5220/0011669300003414
PB - SciTePress