loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: HariKrishna Maganti 1 and Marco Matassoni 2

Affiliations: 1 Fondazione Bruno Kessler, Italy ; 2 IRST, Italy

Keyword(s): Bio-inspired auditory processing, Gammatone filtering, Modulation spectrum, Reverberation, Automatic speech recognition.

Related Ontology Subjects/Areas/Topics: Acoustic Signal Processing ; Biomedical Engineering ; Biomedical Signal Processing ; Speech Recognition

Abstract: Mel-frequency cepstrum based features have been traditionally used for speech recognition in a number of applications, as they naturally provide a higher recognition accuracies. However, these features are not very robust in a noisy acoustic conditions. In this article, we investigate the use of bio-inspired auditory features emulating the processing performed by cochlea to improve the robustness, particularly to counter environmental reverberation. Our methodology first extracts robust noise resistant features by gammatone filtering, which emulate cochlea frequency resolution and then a long-term modulation spectral processing is performed which preserves speech intelligibility in the signal. We compare and discuss the features based upon the performance on Aurora5 meeting recorder digit task recorded with four different microphones in a hands-free mode at a real meeting room. The experimental results show that the proposed features provide considerable improvements with respect to the state of the art feature extraction techniques. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 44.221.43.88

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Maganti, H. and Matassoni, M. (2011). BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT. In Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2011) - BIOSIGNALS; ISBN 978-989-8425-35-5; ISSN 2184-4305, SciTePress, pages 51-58. DOI: 10.5220/0003145800510058

@conference{biosignals11,
author={HariKrishna Maganti. and Marco Matassoni.},
title={BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT},
booktitle={Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2011) - BIOSIGNALS},
year={2011},
pages={51-58},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003145800510058},
isbn={978-989-8425-35-5},
issn={2184-4305},
}

TY - CONF

JO - Proceedings of the International Conference on Bio-inspired Systems and Signal Processing (BIOSTEC 2011) - BIOSIGNALS
TI - BIO-INSPIRED AUDITORY PROCESSING FOR SPEECH FEATURE ENHANCEMENT
SN - 978-989-8425-35-5
IS - 2184-4305
AU - Maganti, H.
AU - Matassoni, M.
PY - 2011
SP - 51
EP - 58
DO - 10.5220/0003145800510058
PB - SciTePress