scene given a set of images. Seen the challenging data
set on which we rely for our reconstruction project,
we concentrated in this work on an experimental pro-
cess for the choice of the best features that we will fur-
ther analyze. We dispose of the following character-
istics in our postcard collection: (1) Data are sparse,
Figure 2: Illustration of the challenging data collection.
and few images are available for the same time pe-
riod. (2) Images are printed and not digital docu-
ments amongst some represent photos that were hand-
colored by archaeologists. (3) Text, stamps and post-
marks can layover the postcards.We can comprehend
from figure 2 the challenges of our data collection.
This paper will be organized as follows. In the
section two we present the evaluated detectors and de-
scriptors. The experimental design is described in de-
tails in the third section. The fourth section shows the
different results obtained and a detailed discussion of
their impact on our application. We finish by a con-
clusion and some further perspectives for the evolu-
tion of our project.
2 FEATURE COMPARISON
In the image analysis step, a feature detection method
is used for the localization of interest points that rep-
resent invariant location with respect to geometric and
photometric transformations. The distinctiveness of
the detected interest point is indexed through a de-
scriptor vector that holds the information content in
the local region centered at the interest point.
Several comparative studies of local region detec-
tors have been presented in the literature. (Miko-
lajczyk and Schmid, 2005) extract affine invariant
regions using different detectors and then compare
different description methods. (Moreels and Peron,
2007) compare several interest point detectors and de-
scriptors to match 3D objects features across view
points and lighting conditions. Other authors compare
them in the context of visual SLAM(vision-based
simultaneous localization and mapping) (Gil et al.,
2010), historic repeat photography (Gat et al., 2011)
or real-time visual tracking (Gauglitz et al., 2011).
We evaluate four different detectors and three
descriptors that were tested in the literature. The
choice was made relatively to our data set applica-
tion. Reckon with the quality of the images we have
selected the detectors that calculate a dense set of effi-
cient features and some relevant descriptors that per-
formed a robust matching.
2.1 Interest Point Detectors
Harris Laplace (Mikolajczyk and Schmid, 2005).
This approach detects corner-like points that are in-
variant to similarity group transformations. They are
detected using a scale-adapted Harris function, then
selected in scale-space by the Laplacian-of-Gaussian
operator.
Hessian Laplace (Mikolajczyk and Schmid, 2005).
This approach detects blob-like structures that are in-
variant to similarity group transformations. Points are
localized in space at the local maxima of the Hessian
determinant and in scale at the local maxima of the
Laplacian-of-Gaussian operator.
Harris Affine (resp. Hessian Affine) (Mikolajczyk
and Schmid, 2005). These detectors are invariant to
affine transformations. The interest points are com-
puted using the Harris Laplace detector (resp. Hes-
sian Laplace detector) then an affine neighborhood is
determined by the affine adaptation process based on
the second moment matrix.
SIFT (Scale Invariant Feature Transform) (Lowe,
2004; Younes et al., 2012). This detector is invari-
ant to affine transformations. It detects distinctive
points using a difference of Gaussian function (DoG)
applied in scale space. Points are selected as local
extrema of the DoG function, while low contrasted
points and points localized on low curvature contours
are rejected.
2.2 Local Descriptors
Steerable Filters (Mikolajczyk and Schmid, 2005).
Designing steerable filters consists in computing up
to 4th order derivatives of a Gaussian function. Cor-
relations between rotated version of the filters with
the image leads to a 14-dimensional descriptor.
PCA-SIFT (Ke and Sukthankar, 2004; Mikolajczyk
and Schmid, 2005). This descriptor is based on a
SIFT-like descriptor on which a PCA (Principal Com-
ponent Analysis) is applied. To compute the 36 di-
mensional vector corresponding to this descriptor, x
and y gradient images are computed in a support re-
gion, sampled at 39×39 locations and then reduced
by PCA.
SIFT (Lowe, 2004; Younes et al., 2012). This de-
scriptor assigns a dominant orientation to each feature
point based on local image gradient directions. The
descriptor is deduced from orientation histograms
VISAPP2013-InternationalConferenceonComputerVisionTheoryandApplications
482