loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Cornelius Glackin 1 ; Julie Wall 2 ; Gérard Chollet 1 ; Nazim Dugan 1 and Nigel Cannings 1

Affiliations: 1 Intelligent Voice Ltd., United Kingdom ; 2 University of East London, United Kingdom

Keyword(s): Phoneme Recognition, Convolutional Neural Network, TIMIT.

Related Ontology Subjects/Areas/Topics: Applications ; Artificial Intelligence ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Processing ; Pattern Recognition ; Symbolic Systems

Abstract: This paper presents a novel application of convolutional neural networks to phoneme recognition. The phonetic transcription of the TIMIT speech corpus is used to label spectrogram segments for training the convolutional neural network. A window of a fixed size slides over the spectrogram of the TIMIT utterances and the resulting spectrogram patches are assigned to the appropriate phone class by parsing TIMIT’s phone transcription. The convolutional neural network is the standard GoogLeNet implementation trained with stochastic gradient descent with mini batches. After training, phonetic rescoring is performed in the usual way to map the TIMIT phone set to the smaller standard set. Benchmark results are presented for comparison to other state-of-the-art approaches. Finally, conclusions and future directions with regard to extending the approach are discussed.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.116.49.243

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Glackin, C.; Wall, J.; Chollet, G.; Dugan, N. and Cannings, N. (2018). Convolutional Neural Networks for Phoneme Recognition. In Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-276-9; ISSN 2184-4313, SciTePress, pages 190-195. DOI: 10.5220/0006653001900195

@conference{icpram18,
author={Cornelius Glackin. and Julie Wall. and Gérard Chollet. and Nazim Dugan. and Nigel Cannings.},
title={Convolutional Neural Networks for Phoneme Recognition},
booktitle={Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2018},
pages={190-195},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006653001900195},
isbn={978-989-758-276-9},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 7th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Convolutional Neural Networks for Phoneme Recognition
SN - 978-989-758-276-9
IS - 2184-4313
AU - Glackin, C.
AU - Wall, J.
AU - Chollet, G.
AU - Dugan, N.
AU - Cannings, N.
PY - 2018
SP - 190
EP - 195
DO - 10.5220/0006653001900195
PB - SciTePress