loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Tarig Ballal ; Nedelko Grbic and Abbas Mohammed

Affiliation: Blekinge Institute of Technology, Sweden

Keyword(s): BSS, blind source separation, speech enhancement, speech analysis, speech synthesis.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: In this paper we exploit the amplitude diversity provided by two sensors to achieve blind separation of two speech sources. We propose a simple and highly computationally efficient method for separating sources that are W-disjoint orthogonal (W-DO), that are sources whose time-frequency representations are disjoint sets. The Degenerate Unmixing and Estimation Technique (DUET), a powerful and efficient method that exploits the W-disjoint orthogonality property, requires extensive computations for maximum likehood parameter learning. Our proposed method avoids all the computations required for parameters estimation by assuming that the sources are "cross high-low diverse (CH-LD)", an assumption that is explained later and that can be satisfied exploiting the sensors settings/directions. With this assumption and the W-disjoint orthogonality property, two binary time-frequency masks that can extract the original sources from one of the two mixtures, can be constructed directly from the a mplitude ratios of the time-frequency points of the two mixtures. The method works very well when tested with both artificial and real mixtures. Its performance is comparable to DUET, and it requires only 2% of the computations required by the DUET method. Moreover, it is free of convergence problems that lead to poor SIR ratios in the first parts of the signals. As with all binary masking approaches, the method suffers from artifacts that appear in the output signals. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 54.82.44.149

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Ballal, T.; Grbic, N. and Mohammed, A. (2006). A SIMPLE AND COMUTATIONALLY EFFICIENT ALGORITHM FOR REAL-TIME BLIND SOURCE SEPARATION OF SPEECH MIXTURES. In Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP; ISBN 978-972-8865-64-1, SciTePress, pages 105-109. DOI: 10.5220/0001571901050109

@conference{sigmap06,
author={Tarig Ballal. and Nedelko Grbic. and Abbas Mohammed.},
title={A SIMPLE AND COMUTATIONALLY EFFICIENT ALGORITHM FOR REAL-TIME BLIND SOURCE SEPARATION OF SPEECH MIXTURES},
booktitle={Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP},
year={2006},
pages={105-109},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001571901050109},
isbn={978-972-8865-64-1},
}

TY - CONF

JO - Proceedings of the International Conference on Signal Processing and Multimedia Applications (ICETE 2006) - SIGMAP
TI - A SIMPLE AND COMUTATIONALLY EFFICIENT ALGORITHM FOR REAL-TIME BLIND SOURCE SEPARATION OF SPEECH MIXTURES
SN - 978-972-8865-64-1
AU - Ballal, T.
AU - Grbic, N.
AU - Mohammed, A.
PY - 2006
SP - 105
EP - 109
DO - 10.5220/0001571901050109
PB - SciTePress