loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: E. Didiot ; I. Illina ; O. Mella ; D. Fohr and J.-P. Haton

Affiliation: LORIA-CNRS & INRIA Lorraine, France

Keyword(s): Speech/music discrimination, wavelets, static and dynamic parameters, long-term parameters, classifiers fusion.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: The problem of speech/music discrimination is a challenging research problem which significantly impacts Automatic Speech Recognition (ASR) performance. This paper proposes new features for the Speech/Music discrimination task. We propose to use a decomposition of the audio signal based on wavelets, which allows a good analysis of non stationary signal like speech or music. We compute different energy types in each frequency band obtained from wavelet decomposition. Two class/non-class classifiers are used : one for speech/non-speech, one for music/non-music. On the broadcast test corpus, the proposed wavelet approach gives better results than the MFCC one. For instance, we have a significant relative improvements of the error rate of 39% for the speech/music discrimination task.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.141.8.247

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Didiot, E.; Illina, I.; Mella, O.; Fohr, D. and Haton, J. (2006). SPEECH/MUSIC DISCRIMINATION BASED ON WAVELETS FOR BROADCAST PROGRAMS. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP; ISBN 978-972-8865-64-1, SciTePress, pages 151-156. DOI: 10.5220/0001572901510156

@conference{sigmap06,
author={E. Didiot. and I. Illina. and O. Mella. and D. Fohr. and J.{-}P. Haton.},
title={SPEECH/MUSIC DISCRIMINATION BASED ON WAVELETS FOR BROADCAST PROGRAMS},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP},
year={2006},
pages={151-156},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001572901510156},
isbn={978-972-8865-64-1},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP
TI - SPEECH/MUSIC DISCRIMINATION BASED ON WAVELETS FOR BROADCAST PROGRAMS
SN - 978-972-8865-64-1
AU - Didiot, E.
AU - Illina, I.
AU - Mella, O.
AU - Fohr, D.
AU - Haton, J.
PY - 2006
SP - 151
EP - 156
DO - 10.5220/0001572901510156
PB - SciTePress