Authors:
Islem Jarraya
;
Salah Werda
and
Walid Mahdi
Affiliation:
University of Sfax, Tunisia
Keyword(s):
Lip Localization, Geometric Lip Model, Lip Tracking, Lip Descriptors Extraction, Viseme Classification and Recognition.
Related
Ontology
Subjects/Areas/Topics:
Image and Video Processing, Compression and Segmentation
;
Multimedia
;
Multimedia Signal Processing
;
Telecommunications
Abstract:
The automatic lip-reading is a technology which helps understanding messages exchanged in the case of a noisy environment or of elderly hearing impairment. To carry out this system, we need to implement three subsystems. There is a locating and tracking lips system, labial descriptors extraction system and a classification and speech recognition system. In this work, we present a spatio-temporal approach to track and characterize lip movements for the automatic recognition of visemes of the French language. First, we
segment lips using the color information and a geometric model of lips. Then, we apply a particle filter to track lip movements. Finally, we propose to extract and classify the visual informations to recognize the pronounced viseme. This approach is applied with multiple speakers in natural conditions.