2 STATE OF THE ART
Different algorithms have been described in the lit-
erature extracting the twodimensional position of ob-
jects from SEM images for automation purposes. The
performance of these algorithms is good, making
first simple automation scenarios possible. The ap-
proaches used for twodimensional tracking on SEM
images and their possible extensions to 3D-tracking
will be summarized in the following.
One of the first and most simple approaches used
template matching as the basis of the algorithm (Siev-
ers and Fatikow, 2006). A template image is extracted
or loaded which contains the object to be tracked. The
template image is then cross-correlated with a search
area in the input image. The maximum value of the re-
sulting array is in the place where the template is most
likely to be found. Due to the use of cross correlation,
this approach is very robust against additive noise,
which is an advantage especially for fast scanned
SEM images. Problems of this approach are that the
algorithm is sensitive against certain changes in the
object appearance which may occur during handling
processes. Examples for these appearance changes
are rotation of the object, scaling of the object due to
magnification changes and partial occlusion by other
objects in the setup. Removing these weaknesses for
this method comes with increased computational ef-
fort so that a fast enough calculation is not always
possible. Extraction of the z-position is not featured
and cannot easily be added.
If instead of a template image a CAD model of
the object is available, it is possible to use rigid body
models to track the object in the SEM image (Kra-
tochvil et al., 2007). The implementation uses mea-
surement lines orthogonal to the model edges and
tries to fit the model to visible edges in the image.
Model edges which should be invisible are identified
and not used for the pose estimation. Though edge de-
tection is difficult in noisy SEM images, the approach
yields good results using advanced techniques for dis-
carding or outweighting false edges and through the
high number of measurement lines used. When three-
dimensional CAD-models are used, it is possible to
recover the threedimensional pose including in-plane
and out-of-plane rotations, except the z-position. The
extension for true threedimensional tracking relies on
a model of the SEM image projection to yield the z-
component of the position. This seems to be working
for low magnifications.
Another possibility is the use of active contours
or snakes (for details about this concept see (Blake
and Isard, 2000)), which do not rely on much pre-
existing knowledge about the object. Active contours
are parametrized curves in twodimensional space, that
means in the image plane. After coarse initialization
the contour is evolving to segment the object from the
scene. The contours are coupled with an energy func-
tion dependent on their shape or appearance, and on
the image data. This energy function is being mini-
mized by moving contour points or the contour as a
whole. The part dependent on the contour is called
internal energy, the part dependent on the image data
is the external energy. In the original formulation,
the external energy function was defined to be depen-
dent on the distance of the contour from edges in the
image, as explained in (Kass et al., 1988). For the
use with noisy SEM images, a region-based approach
(see (Sievers, 2006) and (Sievers, 2007)) has shown
to be useful. The external energy function here is de-
pendent on the region statistics and the noise charac-
teristics of the imaging source. The goal is to max-
imize the compound probability of the enclosed re-
gion. This approach has proven to be very robust to
additive noise, and is inherently robust against scaling
and rotation. If the contour minimization is restricted
to the euclidean transform space, robustness against
partial occlusion is added. Due to the model-free na-
ture of this approach, threedimensional tracking is not
immediately possible, but the coupling with focus-
based methods is principally possible and shows first
promising results in the SEM.
In this paper, the last tracking approach is taken
as a basis, and extended to use defocus analysis for
depth estimation. The extracted information is only
the z-position of the tracked object, without any struc-
tural information about the object. For the recovery of
threedimensional structure of objects, different meth-
ods may be used (see e.g. (Fernandez et al., 2006) or
(J
¨
ahnisch and Fatikow, 2007)).
3 PRINCIPLE
The principle of the twodimensional tracking has
been explained already in (Sievers, 2006). The impor-
tant aspect is that the active contour algorithm does
not only deliver the position information of the ob-
ject, but at the same time calculates a segmentation
of the object from the rest of the scene. This enables
further analysis of the enclosed object.
Due to the working principle of the SEM, only a
certain range around the set working distance is de-
picted sharp. Though this range is quite big in com-
parison to optical microscopes, defocusing is still evi-
dent, as can be seen in figure 2. The defocusing in the
SEM has been used already in (Eichhorn et al., 2008)
to determine the z-position of objects by generating
THREEDIMENSIONAL TRACKING USING OBJECT DEFOCUS - In Twodimensional Scanning Electron Microscope
Images
73