Authors:
Alexander Usoltsev
;
Dijana Petrovska-Delacrétaz
and
Khemiri Houssemeddine
Affiliation:
Université Paris-Saclay, France
Keyword(s):
Biometrics, Full Video Processing, Face, Speech, Score Fusion.
Related
Ontology
Subjects/Areas/Topics:
Applications
;
Biomedical Engineering
;
Biomedical Signal Processing
;
Biometrics
;
Biometrics and Pattern Recognition
;
Cardiovascular Imaging and Cardiography
;
Cardiovascular Technologies
;
Computer Vision, Visualization and Computer Graphics
;
Health Engineering and Technology Applications
;
Image and Video Analysis
;
Multimedia
;
Multimedia Signal Processing
;
Pattern Recognition
;
Signal Processing
;
Software Engineering
;
Telecommunications
;
Video Analysis
Abstract:
This paper describes a bi-modal biometric verification system based on voice and face modalities, which takes advantage of the full video processing instead of using still-images. The bi-modal system is evaluated on the MOBIO corpus and results show a relative improvement of performance by nearly 10% when the whole video is used. The fusion between face and speaker verification systems, using linear logistic regression weights, gives a relative improvement of performance that varies between 30% and 60% comparing to the best uni-modal system. Proof-of-concept iPad application is developed based on the proposed bi-modal system.