Figure 1: Face detection system
3 POLIGONAL PROJECTION
When a neural network is used in image pattern
recognition, it is necessary to extract features from
the images to feed the neural network.
Although the bitmap is often used as features to a
neural network, sometimes it may not be adequate.
Changes in size, position or rotation in the image are
likely to change most values in the bitmap, but are
unlikely to change the class of the image.
If, for example, a face image is slightly shifted to
the left, most features extracted from bitmap values
will probably change. However, that will not change
the fact that it is a face. The neural network will
have to generalize these differences in order to
achieve good results.
This paper proposes a method of feature
extraction to help in image size and position
normalization. It was previously used in (Vianna and
Rodrigues, 2000) and it is called polygonal face
projection. In (Vianna and Rodrigues, 2000), a black
and white handwritten character is placed inside of a
polygon. Then, distances between the polygon’s
sides and the first black pixel in the character are
computed. A set of these distances is used to
represent the character to the neural network. This
method, called polygonal character projection,
improves generalization in neural network character
recognition (Vianna and Rodrigues, 2000).
However, it is not possible to apply the
polygonal projection proposed in (Vianna and
Rodrigues, 2000) in neural face detection. Faces are
grayscale images, and changing it to black and white
will probably cause the loss of relevant information.
This paper proposes an adaptation on the
polygonal character projection method to allow its
use in a grayscale image, where there is no simple
method to measure the distances between the first
black pixel and the polygon’s side. A concept of
projection energy is created.
Projection energy is a number, previously
defined, that will be subtracted from image
luminance values in a certain projection direction.
Let figure 2 defines a projection direction; the
energy value will be subtracted by luminance pixels
values in the direction of arrow in figure 2.
Figure 2: Projection direction in a face.
Thus, from the border of the image in figure 2,
energy will be subtracted by luminance pixels in the
projection direction. When the luminance of a new
pixel is subtracted and the resultant energy becomes
zero or less, a distance between the initial point and
the zero point is computed, this is the projection
distance and it is considered a feature extracted from
the original face.
Polygonal projection with energy concept is
related to x-ray feature extraction used in medicine.
In this case, an x-ray emitter will sensibilize a
special film according to blocking characteristics in
the objects. Bones, for example, usually block x-ray
emission, making the film white.
In a polygonal face projection, higher values of
luminance will block projection and result in lower
distance values. On the other hand, if only lower
luminance pixels are found in the projection
direction, the distance extracted will be higher.
In the case of face detection, inverting the image
before extracting distances using polygonal
projection could lead to better results. As a result of
image inversion, black areas will block projection
and white areas will not.
The motivation for the inversion of the image
can be seen in figure 2. The eyes’ position is likely
to be darker than the rest of face image, which
facilitates their detection, since it will probably
block projection. The mouth and nose area also
likely to be darker than the rest of the face image.
Image inversion will make face features such as
eyes, mouth and nose to be detected by polygonal
projection, because it will block projection.
Detecting eyes, mouth and nose position is an
important step to successfully detect a face.
Another adaptation in the polygonal projection
method is to square every element in the original
image before projection. Squaring elements will
reduce very much the values in the range between 0
and 0.5, and will prevent that sequences of lower
luminance values reduce energy. Only values higher
than 0.5 will continue to block projection.
Squaring elements in the original image is also
related to the way x-ray emission is exponentially
attenuated by objects (Jain, 1989).
As in (Vianna and Rodrigues, 2000), choosing
the polygon defines all projection directions. In this
paper, a square will be used as the base polygon;
distances will then be extracted orthogonally to the
square sides, as show in figure 3.
ICEIS 2005 - ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS
420