loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Adnan Firoze ; M. Shamsul Arifin ; Ryana Quadir and Rashedur M. Rahman

Affiliation: North South University, Bangladesh

ISBN: 978-989-8425-54-6

ISSN: 2184-4992

Keyword(s): Speech Recognition, Spectrogram, Fuzzy Logic, STFT, Standard Deviation, Segmentation.

Related Ontology Subjects/Areas/Topics: Advanced Applications of Fuzzy Logic ; Applications of Expert Systems ; Artificial Intelligence and Decision Support Systems ; Enterprise Information Systems ; Information Systems Analysis and Specification ; Tools, Techniques and Methodologies for System Development

Abstract: The paper presents Bangla word speech recognition using spectral analysis and fuzzy logic. As human speech is imprecise and ambiguous, the fuzzy logic – the base of which is indeed linguistic ambiguity, could serve as a more precise tool for analysing and recognizing human speech. Even though the core source of an uttered word is a voiced signal, our system revolves around the visual representation of voiced signals – the spectrogram. The spectrogram may be perceived as a “visual” entity. The essences of a spectrogram are matrices that include information about properties of a sound, e.g., energy, frequency and time. In this research the spectral analysis has been chosen as opposed to image processing for increased accuracy. The decision making process of our system is based on fuzzy logic. Experimental results demonstrate that our system is 80% accurate compared to a commercial Hidden Markov Model (HMM) based speech recognizer that shows 73% accuracy on an average.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.207.254.88

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Firoze, A.; Arifin, M.; Quadir, R. and Rahman, R. (2011). BANGLA ISOLATED WORD SPEECH RECOGNITION.In Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-54-6, ISSN 2184-4992, pages 73-82. DOI: 10.5220/0003492700730082

@conference{iceis11,
author={Adnan Firoze. and M. Shamsul Arifin. and Ryana Quadir. and Rashedur M. Rahman.},
title={BANGLA ISOLATED WORD SPEECH RECOGNITION},
booktitle={Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2011},
pages={73-82},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003492700730082},
isbn={978-989-8425-54-6},
}

TY - CONF

JO - Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - BANGLA ISOLATED WORD SPEECH RECOGNITION
SN - 978-989-8425-54-6
AU - Firoze, A.
AU - Arifin, M.
AU - Quadir, R.
AU - Rahman, R.
PY - 2011
SP - 73
EP - 82
DO - 10.5220/0003492700730082

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.