loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Patricia Besson and Murat Kunt

Affiliation: Signal Processing Institute (ITS), Ecole Polytechnique Fédérale de Lausanne (EPFL), Switzerland

Abstract: This work addresses the problem of detecting the speaker on audio-visual sequences by evaluating the synchrony between the audio and video signals. Prior to the classification, an information theoretic framework is applied to extract optimized audio features using video information. The classification step is then defined through a hypothesis testing framework so as to get confidence levels associated to the classifier outputs. Such an approach allows to evaluate the whole classification process efficiency, and in particular, to evaluate the advantage of performing or not the feature extraction. As a result, it is shown that introducing a feature extraction step prior to the classification increases the ability of the classifier to produce good relative instance scores.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.224.53.202

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Besson, P. and Kunt, M. (2006). Hypothesis Testing as a Performance Evaluation Method for Multimodal Speaker Detection. In Proceedings of the 2nd International Workshop on Biosignal Processing and Classification (ICINCO 2006) - BPC; ISBN 978-972-8865-67-2, SciTePress, pages 106-115. DOI: 10.5220/0001224701060115

@conference{bpc06,
author={Patricia Besson. and Murat Kunt.},
title={Hypothesis Testing as a Performance Evaluation Method for Multimodal Speaker Detection},
booktitle={Proceedings of the 2nd International Workshop on Biosignal Processing and Classification (ICINCO 2006) - BPC},
year={2006},
pages={106-115},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001224701060115},
isbn={978-972-8865-67-2},
}

TY - CONF

JO - Proceedings of the 2nd International Workshop on Biosignal Processing and Classification (ICINCO 2006) - BPC
TI - Hypothesis Testing as a Performance Evaluation Method for Multimodal Speaker Detection
SN - 978-972-8865-67-2
AU - Besson, P.
AU - Kunt, M.
PY - 2006
SP - 106
EP - 115
DO - 10.5220/0001224701060115
PB - SciTePress