space to the physical model space. The word space
and the physical model space are made for each
facial element, and the mapping is executed for each
facial element respectively.
Before synthesizing facial elements, the words
expressing their shape (they are called the feature
words) are collected from a sentence describing
facial elements or testimony of a witness. This part
is not included in this paper. A model corresponding
to an extracted feature word is provided through
mapping with respect to individual facial elements,
and then a human face is synthesized through
combining all physical models of facial elements
together.
3 PROPOSED METHOD
We define the process where a human face is
synthesized based on the word information as a
mapping from a word space that is organized with
the words expressing the dimension and shape of
facial elements to a physical model space where
physical shape of the facial elements are concretely
formed. In this section, we explain the both spaces
and the mapping function from the word space to the
physical model space.
3.1 Word Space
Many feature words were collected with respect to
individual facial elements, e.g. mouth, nose, eyes,
eye-brows, cheeks, jaw, and profile, from a Japanese
dictionary (Kindaichi, 2001). Then, every word
space with respect to individual facial elements is
formed. The procedure of forming the word space is
as follows. First of all, a similarity matrix among the
feature words is obtained from an experiment using
subjects, based on the similarity of the shape
recalled by the words. Second, a spatial layout of the
feature words is obtained by inputting the similarity
matrix into Multi-Dimensional Scaling method
(MDS). This spatial layout of the feature words
based on the similarity matrix of the feature words is
the word space. Figure 2 is the word space of nose.
The figure is projected on a two dimensional plane
for recognizing it visually. The word space is
characterized by the following facts;
(1) The origin is neutral, the farther away a feature
word is from the origin, the greater the
characteristic of the feature word.
(2) Similar words are arranged close together, while
dissimilar words are arranged further away from
each other.
(3) Every word space has six dimensions. This is
determined based on an indicator called "stress,"
which shows how the distance relationship in the
word space satisfies the similarity relationship
between the feature words.
(4) A feature word W
i
in the word space is described
as follows;
12 6
( , ,..., )
i
ww w
W , (1)
i=1, . . . , the number of feature words
3.2 Physical Model Space
Concrete shapes of the facial elements on 3-
dimensional computer graphics (CG) are determined
by xyz coordinates of apexes of a wire frame model.
Although the number of apexes of each facial
element is different from each other, a set of xyz
coordinates of all the apexes in the wire frame model
becomes the parameters of the physical model space.
A physical model M
i
of a facial element is described
as following;
12
(, ,..., )
iii in
MPP P (2)
here, n is the number of apexes of each facial
element. P
ij
is jth apex of a wire frame model.
(, ,)
ij ij ij ij
yz
P , j = 1, . . . , n (3)
3.3 Mapping Function
Several feature words are extracted as training data
from the word space equally in space for each
individual facial element respectively, and the wire
frame models corresponding to the extracted feature
words are built up by means of a CG tool. Table 1
?????
?????
???
????
?????
???????
???????
????
???
??
????
?????
????
???
???
???
????
???
????
?????
???
??????
?????
???
??
???????
????
???
???
??????
?????
???
???????
????
??????
???????
??????
?????
??????
?????
?????
????
?????
?????
???????
?????
??????
?
??
??????????
??????
?????
???
lump
low nose
small nose
short nose
u
ward
thin
long nose
high nose
downward nose
crooked nose
ig nose
fat nose
Figure 2: Word space and feature words as training data in
the case of nose. Since we can not find the correct English
words corresponding to all Japanese feature words, we
display the word space in Japanese except the training data.
GRAPP 2007 - International Conference on Computer Graphics Theory and Applications
222