Automatic Fongbe Phoneme Recognition From Spoken Speech Signal

Fréjus A. A. Laleye; Eugène C. Ezin; Cina Motamed

doi:10.5220/0006004101020109

Automatic Fongbe Phoneme Recognition From Spoken Speech Signal

Fréjus A. A. Laleye, Eugène C. Ezin, Cina Motamed

2016

Abstract

This paper reports our efforts toward an automatic phoneme recognition for an under-resourced language, Fongbe. We propose a complete recipe of algorithms from speech segmentation to phoneme recognition in a continuous speech signal. We investigated a strictly fuzzy approach for simultaneous speech segmentation and phoneme classification. The implemented automatic phoneme recognition system integrates an acoustic analysis based on calculation of the formants for vowel phonemes and calculation of pitch and intensity of consonant phonemes. Vowel and consonant phonemes are obtained at classification. Experiments were performed on Fongbe language (an African tonal language spoken especially in Benin, Togo and Nigeria) and results of phoneme error rate are reported.

References

Anapathy, S., Thomas, S., and Hermansky, H. (2009). Modulation frequency features for phoneme recognition in noisy speech. J. Acoust. Soc. Am, 125(1):EL8-EL1.
Baghdasaryan, A. G. and Beex, A. A. (2011). Automatic phoneme recognition with segmental hidden markov models. In Signals, Systems and Computers (ASILOMAR), 2011 Conference Record of the Forty Fifth Asilomar Conference on, pages 569-574.
chwarz, P., Matejka, P., and Cernocky, J. (2006). Hierarchical structures of neural networks for phoneme recognition. In 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.
Huang, X., Acero, A., and Hon, H.-W. (2001). Spoken language processing, a guide to theory, algorithm and system development. Prentice Hall.
Laleye, F. A. A., Ezin, E. C., and Motamed, C. (2015a). Adaptive decision-level fusion for fongbe phoneme classification using fuzzy logic and deep belief networks. In Proceedings of the 12th International Conference on Informatics in Control, Automation and Robotics, Volume 1, Colmar, Alsace, France, 21-23 July, pages 15-24.
Laleye, F. A. A., Ezin, E. C., and Motamed, C. (2015b). An algorithm based on fuzzy logic for text-independent fongbe speech segmentation. In 11th International Conference on Signal-Image Technology & InternetBased Systems, SITIS 2015, Bangkok, Thailand, November 23-27, pages 1-6.
Lefebvre, C. and Brousseau., A. (2001). A grammar of fonge, de gruyter mouton. page 608.
marani, S., Raviram, P., and Wahidabanu, R. (2009). Implementation of hmm and radial basis function for speech recognition. In Int. Conf. on Intelligent Agent and Multi-Agent Systems, 2009 (IAMA 2009), Chennai, pages 1-4.
Palaz, D., Collobert, R., and Magimai.-Doss, M. (2013). End-to-end phoneme sequence recognition using convolutional neural networks. Idiap-RR.
Solera-Urena, R., Martin-Iglesias, D., Gallardo-Antolin, A., Pelaez-Moreno, C., and Diaz-de Maria, F. (2007). Robust asr using support vector machines. Speech Communication, 49(4):253-267.
Trentin, E. and Gori, M. (2007). A survey of hybrid ann/hmm models for automatic speech recognition. Neurocomputing, 37(1):91-126.
Young, S. (2008). Hmms and related speech recognition technologies. Springer Handbook of Speech Processing, Springer-Verlag Berlin Heidelberg, pages 539- 557.
Yousafzai, J., Cvetkovic, Z., and Sollich, P. (2009). Tuning support vector machines for robust phoneme classification with acoustic waveforms. In 10th Annual conference of the International Speech communication association, pages 2359 - 2362, England. ISCAINST SPEECH COMMUNICATION ASSOC.

Download

Paper Citation

in Harvard Style

Laleye F., Ezin E. and Motamed C. (2016). Automatic Fongbe Phoneme Recognition From Spoken Speech Signal . In Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO, ISBN 978-989-758-198-4, pages 102-109. DOI: 10.5220/0006004101020109

in Bibtex Style

@conference{icinco16,
author={Fréjus A. A. Laleye and Eugène C. Ezin and Cina Motamed},
title={Automatic Fongbe Phoneme Recognition From Spoken Speech Signal},
booktitle={Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,},
year={2016},
pages={102-109},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006004101020109},
isbn={978-989-758-198-4},
}

in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Informatics in Control, Automation and Robotics - Volume 1: ICINCO,
TI - Automatic Fongbe Phoneme Recognition From Spoken Speech Signal
SN - 978-989-758-198-4
AU - Laleye F.
AU - Ezin E.
AU - Motamed C.
PY - 2016
SP - 102
EP - 109
DO - 10.5220/0006004101020109