(
(
(
1
(, )Gxy= Crx,y+hx,y+Lx,y∇⎡ ⎤
⎣⎦
() ()() ()
2
(, ) 3G x y = L x, y Cr x, y S x, y h x, y∇−−−∗⎡⎤
⎣⎦
(1)
(2)
the vertical passing by P
3
. As said previously, the
difficulty of the task is that we can find between lips
different areas with similar or largely different color,
texture or luminance than lips, when a mouth is
open. The main goal is to find the adequate
information that can emphasize the inner contour for
every configuration. Experimental study on
thousands of face images has shown that no single
data can reach this goal and we have to consider a
combination between the information coming from
different spaces, each information emphasizing the
boundary for one specific configuration. For
example, lips are represented by a high pseudo-hue
and a high red component, teeth are bright and
saturated in color, the oral cavity is very dark, when
gums and tongue could have the same aspect than
lips. We build experimentally two gradients (G
1
and
G
2
) of mixed information coming from different
spaces to find P
7
and P
8
.
P
7
is found by searching
the maximum of the gradient G
1
(see equation 1)
between P
3
and P
6
. P
8
is found by searching the
maximum of the gradient G
2
(see equation 2)
between P
3
and P
7
. In order to avoid false detection
due to noise, we cumulate the different gradients on
10 columns around P
3
and we choose the point with
the highest cumulated gradient.
where Cr comes from the YCbCr space, h
is the
pseudo-hue, L is the luminance and S is the
saturation component of the HSV space. Each
component is normalized between 0 and 1. The
pseudo-hue, introduced by Hulbert et al. (Hulbert,
1998), is the ratio h = R/R+G, where R and G are the
red and green components of the RGB color space.
The pseudo-hue emphasizes contrast between lips
and skin (Eveno, 2004).
From P
8
and P
7
, we compute two seeds P’
8
and
P’
7
for the initialization of the jumping snakes. P’
8
is
¾ of the segment [P
3
P
8
] and P’
7
is ¾ of the segment
[P
6
P
7
] (see figure 5). With this configuration, the
seeds are closer to the inner contours than eventual
noise contours.
Figure 5: Detection of jumping snake seeds.
For the convergence of the snakes, we have also
to find gradients which emphasize the inner
boundary in every configuration. In the same way,
we experimentally build two kinds of space
combination. For the upper inner contour, the
convergence of the first jumping snake gives the
initial contour C
2.
P’
8
is taken as seed and the snake
parameters are chosen so that the two snake’s
branches tend to go down. G
3
(see equation 3) is the
gradient used for the snake’s growth phase. For the
lower inner contour, the convergence of the second
jumping snake gives the initial contour C
3.
P’
7
is
taken as seed and the snake parameters are chosen so
that the two snake’s branches tend to go up. G
4
(see
equation 4) is the gradient used for the snake’s
growth phase (see figure 6).
(3)
(4)
where R is the red component of the RGB space,
L is the luminance, u comes from the CIELuv space
(Wyszecki, 1982) and h is the pseudo-hue. Each
component is normalized between 0 and 1.
These 2 gradient definitions were chosen
because:
− the luminance L and the pseudo-hue h are
generally higher for the lips than inside the
mouth (in particular than the oral cavity, where
L and h are close to zero),
− the component u is higher for the lips than for
the teeth (indeed u is close to zero for the teeth)
− and the component R can be lower for the lips
than inside the mouth in others cases.
The sign is different between G
3
and G
4
because the
lips are above the inside of the mouth with G
3
,
whereas the lips are below the inside of the mouth
with G
4
.
We take the two closest points (P’’
8
and P’’
7
) to
the vertical passing by P
3
on each contour C
2
and C
3
as key points for our inner lip model.
Figure 6: Jumping snake convergences and detection of
key points.
)()()
3
(, )G x y = R x,y u x,y h x,y∇−−
⎤
⎦
4
(, )G xy= Lx,y+ux,y+hx,y∇
⎤
⎦
VISAPP 2008 - International Conference on Computer Vision Theory and Applications
300