and passive methods (Ben Amor et al. 2005).
Actives methods consist in combining an optical
sensor with a source of light, like for example Laser
scanners, sensors that use structured lights (Salvi,
2004), or profilometry (Rohr and Schrader, 1998).
Passives methods rather use one or more images like
in (Hernandez and Schmitt, 2003) or like
stereophotogrammetry (D'Apuzzo, 2002). In our
application, we have three main constraints: to
acquire the relief in conjunction with the visual
aspect of the skin, to do this with as less constraints
as possible for the experimenter and the subject, to
keep a high visual fidelity to the real skin. These
constraints thus exclude Laser-based systems,
methods that need heavy hardware, or systems that
do not acquire both the relief and texture. Finally,
the last constraint tends to exclude systems that
perform a 3D reconstruction, because this process
alters the visual quality of the acquired relief
compared to a high quality photograph for instance
(Ayoub et al. 1998).
For this module of Skin3D, we have conceived
an acquisition system on the basis of two cameras
assembled together and which are triggered in a
synchronized way. We have designed a specific
lighting system and we have used an optical sensor
to calibrate all graphic devices (cameras, screens,
video projectors, etc).
We have used two Pentax K10 reflex cameras
with a resolution of 10 megapixels and with 50mm
macro objective (ideal for taking pictures of faces).
The stereoscopic support allows us to minimize the
distance between the two cameras (the natural gap
between human eyes is about 6cm) and to maintain
parallelism between cameras (or a small converging
angle of a few degrees only). These conditions are
necessary to ensure a comfortable visualization for
the user during the stereoscopic projection without
important modifications of the original images.
Taking such stereoscopic pictures requires an
adapted lighting system. In our first test we deal
mainly with faces, so we have selected specific
lights to reveal the skin relief while removing all
shadows inherent the shape of the face. This lighting
system is compound of two HMI torches and two
Pentax AF 540 FGZ flashes.
In order to obtain the highest image fidelity both
for the acquisition and visualization, we calibrate all
graphic devices by associating them an ICC profile
(International Color Consortium). This is performed
using an i1 (X-rite) sensor and a standardized color
board.
2.2 Calibration with a Genetic
Algorithm
Camera calibration is a crucial step in stereovision
(Faugeras et al., 1987) because it will determine the
accuracy of the acquired relief. It consists in
estimating the intrinsic and extrinsic parameters of
the cameras (i.e. focal length, distortion,
rotation/translation between the two cameras, etc).
Numerous methods exist in this context (Tsai, 1987)
without a real consensus, even if some algorithms
are relatively common (Zhang, 1998). The type of
methods we have selected consist in taking pictures
of a calibration target with known dimensions, and
then to estimate the parameters that minimize a
target « reconstruction » error. These methods
involve non linear optimization procedures which
may have some problems (stability, initial starting
point). This has lead researchers to make use of
genetic and evolutionary algorithms which are
stochastic procedures with less sensitivity. In this
context, one may cite for instance (Zhang and Ji,
2001) where a single camera is calibrated, (Cerveri
et al. 2001) who use evolution strategies for
stereovision, or (Dipanda et al. 2003) who use one
camera and a Laser.
We have developed a new calibration method,
based on genetic algorithms, and which
distinguishes itself from the others on the following
points: it is specific to stereovision, it uses the notion
of distance between points in its evaluation function
(because we want to make precise measurements), it
can be applied to several models of objectives (“pin-
hole” model but also telecentric model). It proceeds
in the following way (see figure 1, step 2): we take
pictures of a target of known dimensions and with
different orientations, then we detect specific points
(corners) on this target in both left and right images.
The distances between these points are perfectly
known. The objective of our genetic algorithm is to
find the set of parameters that minimizes the
prediction error (i.e. the difference between known
and estimated distances). For this purpose, it uses a
population of individual, where each individual is a
possible set of parameters. At the beginning, the
population is filled with randomly generated
individuals where parameters belong to loosely
defined intervals. Then parents are selected
according to their performance using binary
tournament selection, and we recombine these
individuals using a crossover operator (either a
linear recombination or a discrete uniform
crossover) and using a mutation operator (small
random noise). The evaluation function takes into
HEALTHINF 2009 - International Conference on Health Informatics
370