loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Joseph Razik 1 ; Sébastien Paris 2 and Hervé Glotin 1

Affiliations: 1 Université du Sud Toulon-Var, France ; 2 Université Aix-Marseille, France

Keyword(s): MFCC, GMM, Sparse coding, Large-scale SVM, Explicit feature maps.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Learning in Process Automation ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: We present in this paper a novel approach for the phoneme recognition task that we want to extend to an automatic speech recognition system (ASR). Usual ASR systems are based on a GMM-HMM combination that represents a fully generative approach. Current discriminative methods are not tractable in large scale data set case, especially with non-linear kernel. In our system, we introduce a new scheme using jointly sparse coding and an approximation additive kernel for fast SVM training for phoneme recognition. Thus, on a broadcast news corpus, our system outperforms the use of GMMs by around 2.5% and is computationally linear to the number of samples.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.224.90.25

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Razik, J.; Paris, S. and Glotin, H. (2012). BROADCAST NEWS PHONEME RECOGNITION BY SPARSE CODING. In Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM; ISBN 978-989-8425-99-7; ISSN 2184-4313, SciTePress, pages 191-197. DOI: 10.5220/0003778201910197

@conference{icpram12,
author={Joseph Razik. and Sébastien Paris. and Hervé Glotin.},
title={BROADCAST NEWS PHONEME RECOGNITION BY SPARSE CODING},
booktitle={Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM},
year={2012},
pages={191-197},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003778201910197},
isbn={978-989-8425-99-7},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 1st International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM
TI - BROADCAST NEWS PHONEME RECOGNITION BY SPARSE CODING
SN - 978-989-8425-99-7
IS - 2184-4313
AU - Razik, J.
AU - Paris, S.
AU - Glotin, H.
PY - 2012
SP - 191
EP - 197
DO - 10.5220/0003778201910197
PB - SciTePress