Audiovisual Data Fusion for Successive Speakers Tracking

Quentin Labourey; Olivier Aycard; Denis Pellerin; Michele Rombaut

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Audiovisual Data Fusion for Successive Speakers Tracking

Topics: Event and Human Activity Recognition; Machine Learning Technologies for Vision; Object Detection and Localization; Visual Attention and Image Saliency

In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, 696-701, 2014 , Lisbon, Portugal

Authors: Quentin Labourey ¹ ; Olivier Aycard ² ; Denis Pellerin ³ and Michele Rombaut ³

Affiliations: ¹ LIG and GIPSA-lab, France ; ² LIG, France ; ³ GIPSA-lab, France

Keyword(s): Audiovisual Data Fusion, Skin Detection, Sound Source Tracking, Talking Face Tracking.

Related Ontology Subjects/Areas/Topics: Computer Vision, Visualization and Computer Graphics ; Image and Video Analysis ; Visual Attention and Image Saliency

Abstract: In this paper, a human speaker tracking method on audio and video data is presented. It is applied to conversation tracking with a robot. Audiovisual data fusion is performed in a two-steps process. Detection is performed independently on each modality: face detection based on skin color on video data and sound source localization based on the time delay of arrival on audio data. The results of those detection processes are then fused thanks to an adaptation of bayesian filter to detect the speaker. The robot is able to detect the face of the talking person and to detect a new speaker in a conversation.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 18.117.101.130

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Labourey, Q., Aycard, O., Pellerin, D. and Rombaut, M. (2014). Audiovisual Data Fusion for Successive Speakers Tracking. In Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP; ISBN 978-989-758-003-1; ISSN 2184-4321, SciTePress, pages 696-701. DOI: 10.5220/0004852506960701

@conference{visapp14,
author={Quentin Labourey and Olivier Aycard and Denis Pellerin and Michele Rombaut},
title={Audiovisual Data Fusion for Successive Speakers Tracking},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP},
year={2014},
pages={696-701},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004852506960701},
isbn={978-989-758-003-1},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications (VISIGRAPP 2014) - Volume 2: VISAPP
TI - Audiovisual Data Fusion for Successive Speakers Tracking
SN - 978-989-758-003-1
IS - 2184-4321
AU - Labourey, Q.
AU - Aycard, O.
AU - Pellerin, D.
AU - Rombaut, M.
PY - 2014
SP - 696
EP - 701
DO - 10.5220/0004852506960701
PB - SciTePress